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Hepatitis C Virus NS5B Cos^ositions ,y > 
and Methods o£ Use Thereof 

Field of the Invention 

5 The present invention relates to hepatitxs C virus' 

(HCV) , and more specifically, to the HCV. NS5B gene and ? • 
the encoded protein. In particular, the: invention . ; ^ 
relates to novel compositions comprising HCV NS5B . 'jv.V '; 
sequences, functional HCV NS5B sequences, functionally . 
10 improved conpositions of HCV NS5B sequences, and to the 
use of such NS5B sequences in research,' -Idi agnostic, ['^'-^j- 
therapeutic and pharmaceutical applications; y*- : 



Cross Reference to Related J^plicationis /• * ' 



15 This application claims priority under 35 U.S.C. 

§119 (e) to US Provisional Application 60/080,509 filed'. 
April 2, 1998 and US Provisional Application 60/090,356 
filed June 23, 1998, the disclosures of each being 
incorporated by reference as though set forth herein in 

20 full. 

Background of the Invention 

Hepatitis C virus (HCV) is a major cause of 
hepatitis globally. The World Health Organization 
25 estimates that 150 million people worldwide are 

presently infected with the virus. Most infections 
become persistent and about 60% of cases develop into /. 
' chronic liver disease . Chronic HCV infection can lead . 
to develojpment of cirrhosis, hepatocellular carcinoma 
30 and liver failure. 

For treatment of hepatitis due to HCV, interferon' 
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alpha (IFN) is currently approved in the U.S. IFN 
treatment is associated with inproved serum enzyme 
response in 20-40% of patients. The remainder are 
nonresponsive to IFN treatment. For responders, a 
5 sustained inprovement of aminotransferase levels is seen 
in only 10-20% of patients; the majority of patients 
relapse upon cessation of IFN treatment. While IFN 
represents the first treatment of chronic hepatitis C, 
its effectiveness is variable, its cure rate is low, and 
10 associated adverse effects are considerable. 

Vaccines under development for HCV generally 
consist of recombinant versions of the putative viral 
structural proteins (C, El, E2) , or genes encoding 
these. It is believed that virus neutralizing 
15 antibodies do exist, can be elicited, and may be able to 
inhibit or prevent HCV infection. However, to date, no 
vaccine has been demonstrated safe and effective for 
HCV. Indeed, given the inherent genetic diversity of 
HCV, with virus isolates exhibiting immunologically 
20 distinct envelope proteins that are not neutralized by 
pre-existing antibodies, vaccine development will be a 
formidable task. 

HCV can not yet be efficiently propagated in vitro. 
This deficiency has compromised a clear understanding of 
25 several aspects of HCV replication- While numerous HCV 
isolates have been molecularly cloned from infected 
patients and have been secjuenced, much remains to be 
learned regarding the required sequence features and 
elements which facilitate efficient replication. From 
30 study of these sequences in a variety of surrogate gene 
expression systems, knowledge of the molecular biology 
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of HCV has expanded greatly in recent years. These 
advancements, which have been facilitated by existing 
knowledge and iinderstanding of the molecular biology of 
the related pestiviruses and f laviviruses, have yielded 
5 itrqportcmt insights into the virus-specified conponents 
essential for virus replication that may be suitable . 
targets for antiviral drug discovery strategies. 

As mentioned above,, HCV is a genetically 
heterogeneous virus. It exists in nature, and in 

10 infected individuals, as a "quasispecies" , which means 
that within the virus population from any particular 
source, the viral genetic material is present as a 
collection of closely related, but non-identical 
sequences. This genetic heterogeneity of HCV sequences 

15 may be a consequence of natural errors introduced during 
the normal replication process of HCV, of selection 
pressures placed on the virus by the host, or others 
factors. It is believed that the quasispecies nature of 
HCV may contribute to the virus's ability to avoid 

20 elimination by the host's defense systems and establish 
persistent infections. 

The quasispecies nature of HCV has made it 
difficult to define the sequence of "functional" genes 
arid genomes, particularly in the absence of a controlled 

25 system for the efficient replication of the virus. 

However, functionality of segments of HCV sequences has 
been studied in a variety of surrogate expression 
systems . Several gene products have been shown to be 
functional in in vitro assays in that they exhibit 

30 enzymological activities that are predicted to exist 

based on the presence of particular amino acid secjuence 
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motifs in these gene products. For exanple, the NS3 
protein possesses serine proteinase, nucleoside 
triphosphatase and helicase activities and the NS5B 
protein possesses RNA-dependent RNA polymerase (RdRp) 
5 activity. 

The recerit demonstration of an infectious molecular 
clone of HCV [Kolykhalov et al.. Science 277:570-574 
(1997); Yanagi et al., Proc. Natl. Acad. Sci. USA 
94:8738-8743 (1997)1 allows definition of a complete 

10 functional HCV genome and consequently of a sequence for 
a set of functional HCV gene products from this 
particular strain of HCV obtained from one particular 
source . It is noteworthy that Kolykhalov et al . 
indicate that the difficulties in obtaining a functional 

15 genome included: 1) the highly variable quasispecies 

nature of HCV in infected patients; 2) the required in 
vitro amplification (cDNA synthesis and PCR 
amplification) of the sequences due to the small 
quantities of viral RNA in clinical samples; and 3) the 

20 necessity of molecular cloning (in E. coli) - Each of 
these aspects allows for the possibility and 
presentation of defective (poorly functional or 
nonfiinctional) HCV sequences or for errors in 
transcribing or amplifying HCV sequences. Kolykhalov et 

25 al. indicated that RNAs produced from 34 full genome 
molecular clones failed to exhibit infectivity in 
chimpanzees. Nucleotide sequence eunalyses showed 
numerous sequence changes among the 6 full length clones 
that were completely sequenced. Only after a "consensus 

30 sequence" clone was generated, was positive infectivity 
demonstrable. Yanagi et al. also indicated the high 
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degree of sequence variability among individual 
molecular clones obtained from a single source material 
and the non- infectious nature .'of RNA transcripts from 
full-length molecular clones, Yanagi et al. concluded 
5 that a large proportion of HCV genomes are defective. 
In fact, recent work indicates ^ bias toward the 
preferential selection of defective HCV genomes during 
molecular cloning procedures [Foms, et al., Proc Nat;l 
Acad Sci USA 94:13909-13914 (1997)1. Of 25 random 
10 clones derived from a control plasmid by Foms et al . , 
only 8% were functional in polyprotein synthesis. 

Thus, it is clear from these examples that mere' ^ 
knowledge of a sequence derived from HCV is insufficient 
.. to conclude that such a sequence encodes useful viral . 
15 genes or functional viral gene products. 

Regarding the NS5B gene, while many partial and 
coTtplete gene sequences have been entered into various 
datcQ^ases, only a very few of these sequences have been 
demonstrated to be functional based on in vitro assays 
20 of RdRp activity. 

The present invention provides novel and 
functional, as well as f\inctionally improved, hepatitis 
C virus NS5B sequences for use in research, diagnostic, 
therapeutic and pharmaceutical applications, and for use 
25 in assays for the identification of efficacious 
antiviral agents. 

SUMMARY OF THE INVENTION 

30 The present invention provides novel HCV NS5B 

nucleotide sequences from which recombinant HCV proteins 
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having demonstrable RdRp activity may be derived. 
Sequence modifications that result in improved 
functional activity eire also provided- Such' functional 
recombincuit HCV NS5B proteins have utility in research, 
5 diagnostic, therapeutic and pharmaceutical applications. 
In particular,^ the recombinant HCV NS5B proteins of the 
invention have utility in antiviral drug discovery., 
strategies. .'f-;!' • :.• o/* / /^-^ 

In a preferred embodiment of the invention, ;an 

10 isolated nucleic acid molecule is provided that 

comprises a DNA sequence - identified as SEQ ID NO:. .1, . , 
which is present in clone' 4 of the present invention. 
An exemplary HCV NS5B » protein lias the amino acid ■ {• 
sequence Identified as . SEQ ID NO: 2 encoded by clone 4. 

15 Additional nucleic acid molecules that represent nucleic 
acid sequences related to those of SEQ ID NO: 1, 
including but not limited to those identified by SEQ ID 
NO: 3, 4, 5, 6 coid 7 encoding, respectively, the amino^ 
acid seque^nces identified by SEQ ID NO: 8, 9, 10, 11 and 

20 12 of the HCV NS5B are also contemplated to be within 
the scope of the present invention. It is further 
contemplated that conservative sequence or residue 
substitutions of these sequences are also within the 
scope of the invention. 

25 In an additional embodiment of the invention, 

functionally improved hepatitis C polymerase nucleic 
acid and amino acid sequences are provided. Exemplary 
sequences include those nucleic acid sequences that 
encode amino acid sequences related to those identified 

30 ' ■ 
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by SEQ ID NO: 2 in which particular codons have been 
substituted to encode particular amino acid changes in 
the sequence. Bxariples of useful chcuiges include but 
are not limited to changes at amino acid positions 
5 nuihber 75, number 177 and number 543 of . the mature NS5B 
protein. 

In another embodiment of the invention, 
functionally improved hepatitis C polymerasjB nucleic 
acid and amino acid sequences are provided in which 
10 particular codons are substituted to encode particular 
amino acids not normally fottnd in known sequences of 
NS5B . For example, ^ chcinges including but ; not limited to 
residue changes at amino acid positions riu]^ 1 and • 
number 2 of the mature NS5B protein are useful according 
15 to this invention. 

As described above, nucleic acids encoding variant 
proteins or polypeptides are contemplated to be within 
the scope of the present invention. Sequence ID NO: 13 
provides an example of such a variant secjuence. Such 
20 variants may or may not possess HCV polymerase activity. 
These variants may possess one or more changes each of 
which may include one or more additions, deletions, or 
substitutions of amino acid residues. Preferably, the 
changes will not affect, or substantially affect, the 
25 structure or useful properties of the polypeptide. 

Thus, HCV NS5B variants may suitably possess functional 
NS5B activity such as those described above, or they may 
be poorly functional or, inactive, yet contain 
substantially the secondary and tertiary structure of 
30 the native polypeptide. Such NS5B molecules may be used 
to advantage to identify agents that specifically bind 
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to or otherwise affect the HCV NS5B activity. HCV NS5B 
variants can be either naturally occurring (i.e., 
purified or isolated from a natural source) or synthetic 
(i.e., generated by biological expression of DNA that 
5 has been subjected to site-directed mutagenesis or 

produced by chemical synthetic techniques well known in 
the art) . 

The nucleic acid molecules of the invention may be 
cloned cuid expressed in vectors. Such vectors may be in 

10 the form of, for example, a plasmid, a replication 
competent or defective virus or phage vector or a 
replicon provided typically with an origin of 
replication, optionally a promoter for the expression of 
the polynucleotide and optionally a regulator of the 

15 promoter. The vector may contain one or more selectable 
marker genes, for example an anpicillin resistance gene 
in the case of a bacterial plasmid or a neomycin 
resistance gene for a mammalian vector. The vector may 
be used in vitro, for example for the production of RNA 

20 or protein. The vector may be further used to 

transform, transfect, infect or transduce a host cell or 
an organism. The present invention further contemplates 
the use of host cells and organisms harboring or 
expressing the HCV nucleic acid sequences or 

25 polypeptides of the invention for the identification of 
agents that affect the activity of the HCV NS5B protein . 

In yet another embodiment of the invention, methods 
are provided for the identification of agents that 
affect the HCV NS5B polymerase sequences. Such methods 

30 include high throughput screening procedures that allow 
assessment of large numbers of agents. The agents 
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identified by use of the HCV NS5B nucleic acids and 
polypeptides of the invention, variants thereof, or 
methods of the invention, may be either antagonistic or 
agonistic in their affect on the NS5B sequences. These 
5 agents may include molecules of any number of classes 

including but not limited to small molecules, polymers, 
peptides, polypeptides, imnronoglobulins or fragments 
thereof, oligonucleotides, antisense molecules, , 
peptide-nucleic acid conjugates, ribozymes, 

10 polynucleotides and the like. It is specifically 
contemplated that both antagonistic and agonistic 
molecules identified by practice of the invention have 
broad and multiple utilities. Such utilities for 
antagonists of HCV NS5B activity include, but are not 

15 limited to, uses for the inhibition of HCV replication 

in humans, in other living hosts and in in vitro systems 
such as cell, tissue and organ cultures. Agonists of 
HCV NS5B activity identified by practice of the 
invention will also have multiple utilities, both in 

20 living hosts and in in vitro systems. For example, such 
agents will be useful in the development of auiimal 
models of HCV infection, replication or disease and for 
the propagation of HCV in a living host or in cell, 
tissue or organ culture systems - 

25 According to another aspect of the invention, kits 

are provided to facilitate the use of the compositions 
and methods disclosed herein. Exemplary kits would 
include HCV NS5B nucleic acids and polypeptides of the 
invention, variants thereof, alone or in association 

30 with suitcible vectors. Also included would be protocols 
for use of the compositions of the invention for the 
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particular application and the necessary reagents to 
carry out the application. The reagents of a kit may 
vary depending on the intended application. Such 
reagents may include, but are not limited to buffers, 
5 solvents, media and solutions, sxibstrates and cof actors, 
vectors and host cells, and detection or reporter 
reagents. Other accessories may also be included such 
as vials, vessels euid reaction chambers. 

The following definitions are provided to aid in 
10 understanding the siabject matter regarded as the 
invention. 

As used herein, "hepatitis C virus" or "HCV" shall 
mean any representative of a diverse group of related 
viruses classified within the hepacivirus genus of the 

15 Flaviviridae family. 

"Nucleic acid" or a "nucleic acid molecule" as used 
herein refers to any DNA or RNA molecule, either single 
•or double stranded euid, if single stranded, the molecule 
of its complementary sequence in either linear or 

20 circular form. In discussing nucleic acid molecules, a 
sequence or structure of a particular nucleic acid 
molecule may be described herein according to the normal 
convention of providing the sequence in the 5 • to 3 ' 
direction. With reference to nucleic acids of the 

25 invention, the term "isolated nucleic acid" is sometimes 
used. This term, when applied to DNA, refers to a DNA 
molecule that is separated from sequences with which it 
is immediately contiguous in the naturally occurring 
genome of the organism in which it originated. For 

30 example, an "isolated nucleic acid" may comprise a DNA 
molecule inserted into a vector, such as a plasmid or 
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virus vector, or . integrated into the genomic DNA of a 
prokaryotic or eukaryotic cell or host orgemism. 

When applied to RNA, the term "isolated nucleic 
acid" refers primarily to ain RNA molecule encoded by an 
5 isolated DNA molecule as defined above. Alternatively, 
the term may refer to an RNA molecule that has been 
sufficiently sejpoirated from other nucleic acids with 
which it would be associated . in its natural state (i.e., 
in cells or tissues) . An isolated nucleic acid (either 
10 DNA or RNA) may further represent a molecule produced 

directly by biological or synthetic means and separated 
from other cpnponents. present during its production. 

"Natiiral allelic 'variants", "mutants" and 
"derivatives" of particular sequences of nucleic acids 
15 refer to nucleic acid sequences that are closely related 
to a particular sequence but which may possess, either 
naturally or by design, changes in sequence or 
structure. By closely related, it is meant that at 
least about 75%, but often, more than 90%, of the 
20 nucleotides of the sequence match over the defined 

length of the nucleic acid sequence referred to using a 
specific SEQ ID NO. Changes or differences in 
nucleotide secjuence between closely related nucleic acid 
sequences may represent nucleotide changes in the 
25 sequence that arise during the course of normal 

replication or duplication in nature of the particular 
nucleic acid sequence. Other changes may be 
specifically designed and introduced into the sequence 
for specific purposes, such as to change, an amino acid 
30 codon or sequence in a regulatory region of the nucleic, 
acid. Such specific changes may be made in vitro using 
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a variety of mutagenesis techniques or produced in a 
host organism placed under particular selection 
conditions that induce or select for the changes. Such 
secpience variants generated specifically may be referred 

5 to as "mutants" or "derivatives" of the original • !\ 

sequence. ' . . 

The terms "percent similarity", "percent identity"., 
and "percent homology" . when referring to a particular 
sequence are used as set forth in the University of 

10 Wisconsin GCG software program/ 

The term "NS5B" refers to a portion of the HCV 
genome located near the 3' end of the viral genome that 
specifies ..the .region encoding a protein, termed the ./r;':y 
"NS5B protein", "NS5B polypeptide", "NS5B polymerase" .or, 

15 combinations of these terms which are used 

interchcuigeably herein. NS5B in its natural state, 
functions as an RNA-dependent RNA polymerase (RdRp) . 
The nucleic acid region encoding the NS5B protein may 
also be referred to as the "NS5B gene". Thus, the term 

20 "NS5B" may refer to either a nucleic acid encoding the 
NS5B polypeptide, to an NS5B gene or to an NS5B 
polypeptide, or to any portions thereof, depending on 
the context in which the term is used. NS5B may further 
refer to natxiral allelic variants, mutants and 

25 derivatives of either NS5B nucleic acid sequences or 

NS5B polypeptides. The NS5B nucleic acid, NS5B gene or 
NS5B protein referred to may be either fxxnctional or 
non-f unctional . 

The present invention also includes active 

30 portions, fragments, derivatives and f\inctional or 
non- functional mimetics of HCV NS5B polypeptides or 



12 



wo 99/51781 PCTAJS99/07404 

proteins of the invention. An "active portion" of HCV 
NS5B polypeptide means a peptide that is less than the 
full length HCV NS5B polypeptide, but which retains 
measurable biological activity. 
5 A "fragment" or "portion" of the HCV NS5B 

polypeptide means a stretch of amino acid residues of at 

least about five to seven contiguous amino acids, often 

It 

at least about seven to nine Contiguous amino acids, 
typically at least about nine to thirteen contiguous 

10 amino acids and, most preferably, at least about twenty 
to thirty or more contiguous amino acids. . Fragments of 
the HCV NS5B polypeptide sequence, antigenic 
determinants, ; viral antigens or epitopes are useful for., 
eliciting iraonnune responses to a portion of the HCV NS5B 

15 amino acid sequence. 

A "derivative" of the HCV NS5B polypeptide or a 
fragment thereof means a polypeptide modified by varying 
the amino acid sequence of the protein, e.g. by 
manipulation of the nucleic acid encoding the protein or 

20 by altering the protein itself. Such derivatives of the 
natural amino acid sequence may involve insertion, 
addition, deletion or substitution of one or more amino 
acids, and may or may not alter the essential activity 
of original the HCV NS5B polypeptide. As 

25 mentioned above, the HCV NS5B polypeptide or protein of 
the invention includes any analogue, fragment, 
derivative or mutant which is derived from a HCV NS5B 
polypeptide and which retains at least one property or 
other characteristic of the HCV NS5B polypeptide. • 

30 Different "variants" of the HCV NS5B polypeptide exist 
in nature. These variants may be alleles characterized 
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by differences in the nucleotide sequences of the gene 
coding for the protein, or may involve different RNA 
processing or post-translational modifications. The 
skilled person can produce variants having single or 

5 multiple amino acid substitutions, deletions, additions 
or replacements- These varisoits may include inter alia: 
(a) variants in which one or more amino acids residues 
are substituted with conservative or non-conseirvative 
amino acids, (b) varicuits in which one or more amino 

10 acids are added to the HCV NS5B polypeptide, (c) 

variamts in which one or more amino acids include a 
substituent group, and (d) variants in which the HCV 
NS5B polypeptide is fused with another peptide or 
polypeptide such as a fusion partner, a protein tag or 

15 other chemical moiety, that may confer useful properties 
to the HCV NS5B polypeptide, such as, for example, an 
epitope for an antibody, a polyhistidine sequence, a 
biotin moiety and the like. Other HCV NS5B polypeptides 
of the invention include variants in which amino acid 

20 residues from one species are substituted for the 

corresponding residue in another species, either at the 
conserved or non-conserved positions. In another 
embodiment, amino acid residues at non- conserved 
positions are substituted with conservative or 

25 non- conservative residues. The techniques for obtaining 
these variants, including genetic (suppressions, 
deletions, mutations, etc.), chemical, and enzymatic 
techniques are known to the person having ordinary skill 
in the art. 

30 To the extent such allelic variations, analogues, 

fragments, derivatives, mutants, and modifications, 
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including alternative nucleic acid processing forms and 
alternative post-translational modification forms result 
in derivatives of the HCV NS5B polypeptide that retain 
any of the biological properties of the HCV NS5B 
5 polypeptide, they are included within the scope of this 
invention. 

The term "functional" as used herein implies that 
the nucleic or amino acid, sequence is functional for the 
recited assay or purpose. 

10 The phrase "consisting essentially of" when 

referring to a particular nucleotide or amino acid 
means a sequence having the properties of a given SEQ ID 
No:. For exaiqple, when used in reference to an amino 
acid sequence, the phrase includes the secpience per se 

15 and molecular modifications that would not affect the 
basic and novel characteristics of the sequence. 

A "replicon" is any genetic element, for example, a 
plasmid, cosmid, bacmid, phage or virus, that is capable 
of replication largely under its own control. A replicon 

20 may be either RNA or DNA and may be single or double 
stranded . 

A "vector" is a replicon, such as a plasmid, 
cosmid, bacmid, phage or virus, to which another genetic 
sequence or element (either DNA or RNA) may be attached 

25 so as to bring about the replication of the attached 
sequence or element. 

An "expression operon" refers to a nucleic acid 
segment that may possess transcriptional and 
translational control sequences, such as promoters, 

30 enhancers, translational start signals (e.g., ATG or AUG 
codons) , polyadenylation signals, terminators, and the 
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like, and which facilitate the ejqpression of a 
polypeptide coding sequence in a host cell or organism , 
The term "oligonucleotide," as used herein refers 
to primers and probes of the present invention, and is 
5 defined as a nucleic acid molecule comprised of two or 

more ribo- or .deoxyribonucleotides, preferably more than 
three. The exact size of the oligonucleotide will 
depend on various factors and on the particular 
application and use of the oligonucleotide - 

10 The term "probe" as used herein refers to an 

oligonucleotide, polynucleotide or nucleic acid, either 
RNA or DNA, whether occurring naturally as in a purified 
restriction enzyme digest or produced synthetically, 
which is capeible of annealing with or specifically 

15 hybridizing to a nucleic acid with sequences 

conplementairy to the probe. A probe may be either 
single-stranded or double -stranded. The exact length of 
the probe will depend upon many factors, including 
temperature, source of probe and use of the method. For 

20 example, for diagnostic applicatioYis, depending on the 
complexity of the target sequence, the oligonucleotide 
probe typically contains 15-25 or more nucleotides, 
although it may contain fewer nucleotides. The probes 
herein are selected to be "siibstantially" complementary 

25 to different strands of a particular target nucleic acid 
sequence. This means that the probes must be 
sufficiently complementary so as to be able to 
"specifically hybridize" or anneal with their respective 
target strands under a set of pre -determined conditions. 

30 Therefore, the probe sequence need not reflect the exact 
complementary secjuence of the target. For example, a 
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non- complementary nucleotide fragment may be attached to 

the 5' or 3' end of the probe, with the remainder of the 

probe sequence being conplementary to the target strand. 

Alternatively, non-con?)lementary bases or longer 

5 sequences can be interspersed into the probe, provided 

that the probe, sequence has sufficient complementarity 

with the sequence of the target nucleic acid to cuineal 

therewith specfically. 

The term "specifically hybridize" refers to the 

10 association between two single -stranded nucleic acid 

molecules of sufficiently complementary sequence to 

permit such hybridization under pre-determined 
^ ■ ... ' • 

conditions generally used in the art (sometimes termed 

"substantially complementary") . In particular, the tenn 

15 refers to hybridization of an oligonucleotide with a 

substantially complementary sequence contained within a 
single-stranded DNA or RNA molecule of the invention, to 
the substantial exclusion of hybridization of the 
oligonucleotide with single-stranded nucleic acids of 

20 non- complementary sequence. 

The term "primer" as used herein refers to eui 
oligonucleotide, either RNA or DNA, either 
single-stranded or double -stranded, either derived from 
a biological system, generated by restriction enzyme 

25 digestion, or produced synthetically which, when placed 
in the proper environment, is able to functionally act 
as an initiator of template-dependent nucleic acid 
synthesis. When presented with an appropriate nucleic 
acid tenplate, suitable nucleoside triphosphate 

30 precursors of nucleic acids, a polymerase enzyme, 

suitable cofactors and conditions such as a suitable 
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teitperature and pH, the primer may be extended at its 3' 
terminus by the addition of nucleotides by the action of. 
a polymerase or similar activity to yield an primer 
extension product. The primer may vary in length 

5 depending on the particular conditions and requirement 
of the application. For example, in diagnostic 
applications, the oligonucleotide primer is typically 
:\ 15-25 or more nucleotides in length. The primer must be 
of sufficient complementarity to the desired template to 

10 . prime the synthesis of the desired extension product, 
that is, to be able anneal with the desired template . 
strand in a mcumer sufficient to provide the 3' hydroxyl 
, ( • • moiety of . the primer in appropriate jiixtaposition fof \ = 
use in the initiation of synthesis by a polymerase or : 

15 similar enzyme. It is not recjuired that the primer 

sequence represent an exact conplement of the desired 
template. For example, a non- complementary nucleotide 
sequence may be attached to the 5" end of an otherwise 
complementary primer. Alternatively, non-eomplementary 

20 bases may be interspersed within the oligonucleotide 

primer sequence, provided that the primer sequence has 
sufficient conplemeritarity with the sequence of the 
desired template strand to functionally provide a 
template-primer complex for the synthesis of the 

25 extension product. Amino acid residues described herein 
are preferred to be in the "L" isomeric form. However, 
residues in the "D" isomeric form may be substituted for 
any L-amino acid residue, provided the desired 
properties of the polypeptide are retained. 
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All amino-acid residue sequences represented herein 
conform to the conventional left-to-right amino- terminus 
to carboxy- terminus orientation. 

The term "isolated protein" or "isolated and 
5 purified protein" is sometimes used herein. This term . 
refers primarily to a protein produced by expression of 
an isolated nucleic acid molecule of the invention. 
TVltematively, this term may refer to a protein that has 
been sufficiently separated from other proteins with 
10 which it would naturally be associated, so as to exist ! 

in "substantially pure" form. "Isolated" is not meant to 
exclude artificial or synthetic mixtures with other . 
. compounds or materials, or the presence of impurities 

that do not interfere with the fundamental activity, and 
15 that may be present, for example, due to incoirqplete 

purification, addition of stabilizers, or compounding 
into, for exanple, immiuriogenic preparations or 
pharmaceutically acceptable preparations. 

The term "substantially pxire" refers to a 
20 preparation comprising at least 50-60% by weight of a 
given material (e.g., nucleic acid, oligonucleotide, 
protein, etc.). More preferably, the preparation 
comprises at least 75% by weight, and most preferably / 
90-95% by weight of the given compound. Purity is 
25 measured by methods appropriate for the given compound 

(e.g. chromatographic methods, agarose or polyacrylamide 
gel electrophoresis, HPLC analysis, and the like) . 

"Mature protein" or "mature polypeptide" shall mean 
a polypeptide possessing the sequence of the polypeptide 
30 after cuiy processing events that normally occur to the 
polypeptide during the course of its genesis, such as 
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protoelytic processing from a polyprotein precursor. In 
designating the sequence or boundaries of a mature 
protein, the first amino of the mature protein sequence 
is designated as amino acid residue 1. In the case of 

5 the mature NS5B protein, its normal biogenesis entails 
its proteolytic cleavage from a precursor polyprotein.. 
Thus, the first amino acid residue of the mature NS5B 
protein represents a serine residue for all known HCV 
sequences. As used herein, any amino acid residues 

10 associated with a matiire protein not naturally fotind 

associated with that protein that precedes amino acid 1 
are designated amino acid -1, -2, -3 and so on. For 
recombinant expression systems, a methionine initiator 
codon is often utilized for purposes of efficient 

15 tramslation. Thus, in the case of NS5B, a methionine 
codon may be place immediately proceeding the serine 
codon. This methionine residue in the resulting 
polypeptide, as used herein, would be positioned at -1 
relative to the mature NS5B protein sequence. 

20 The term "tag," "tag secjuence" or "protein tag" 

refers to a chemical moiety, either a nucleotide, 
oligonucleotide, polynucleotide or an amino acid, 
peptide or protein or other chemical, that when added to 
another sequence, provides additional utility or confers 

25 useful properties, particularly in the detection or 
isolation, to that sequence. Thus, for example, a 
homopolymer nucleic acid sequence or a nucleic acid 
sequence complementary to a capture oligonucleotide may 
be added to a primer or probe sec[uence to facilitate the 

30 subsequent isolation of an extension product or 
hybridized product- In the case of protein tags. 
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histidine residues (e.g., 4 to 8 consecutive histidine 
residues) may be added to either the amino- or 
carboxy- terminus of a protein to facilitate protein 
isolation by chelating metal chromatography. 
Alternatively, amino acid sequences, peptides, proteins 
or fusion partners representing epitopes or binding 
determinants reactive with specific antibody molecules 
or other molecules (e.g., flag epitope, c-myc epitope, 
transmembrane epitope of the influenza A virus 
hemaglutinin protein, protein A, cellulose binding 
domain, calmodulin binding protein, maltose binding 
protein, chit in binding domain, glutathione 
S- transferase, and the like) may be added to proteins to 
facilitate protein isolation by procedures such as 
affinity or immunoaf f inity chromatography. Chemical tag 
moieties include such molecules as biotin, which may be 
added to either nucleic acids or proteins and 
facilitates isolation or detection by interaction with 
avidin reagents, and the like. Numerous other tag 
moieties are known to, and can be envisioned by, the 
trained artisan, and are contemplated to be within the 
scope of this definition. 

As used herein, the terms "reporter, " "reporter 
system", "reporter gene," or "reporter gene product" 
shall mean an operative genetic system in. which a 
nucleic acid comprises a gene that encodes a product 
that when expressed produces a reporter signal that is a 
readily measurable, e.g., by biological assay, 
immunoassay, radioimmunoassay, or by colorimetric, 
fluorogenic, chemiluminescent or other methods. The 
nucleic acid may be either RNA or DNA, linear or 



21 



wo 99/51781 PCTAJS99y07404 

circiilar, single or double stranded, antisense or sense 
polarity, and is operatively linked to the necesseiry 
control elements for the expression of the reporter gene 
product. The required control elements will vary 
according to the nature of the reporter system and 
whether the reporter gene is in the form of DNA or RNA, 
but may include, but not be limited to, such elements as 
promoters, enhancers, trsuislational control sequences, 
poly A addition signals, transcriptional termination 
signals and the like. 

The terms "transform", "transfect", "transduce", 
shall refer to any method or means by which a nucleic 
acid is introduced into a cell or host organism and may 
be used interchangeably to convey the same meaning. 
Such methods include, but are not limited to, 
transfection, electroporation, microinjection, PEG- 
f us ion and the like. 

The introduced nucleic acid may or may not be 
integrated (covalently linked) into nucleic acid of the 
recipient cell or organism. In bacterial, yeast, plant 
and mammalian cells, for example, the introduced nucleic 
acid may be maintained as an episomal element or 
independent replicon such as a plasmid. Alternatively, 
the introduced nucleic acid may become integrated into 
the nucleic acid of the recipient cell or organism and 
be stably maintained in that cell or organism and 
further passed on or inherited to progeny cells or 
organisms of the recipient cell or organism. In other 
manners, the introduced nucleic acid may exist in the 
recipient cell or host organism only transiently. 
A "clone" or "clonal cell population" is a 
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population of cells derived from a single cell or coiiiinon 
ancestor by mitosis. 

A "cell line" is a clone of a primary cell or cell, 
population that is capcd^le of stable growth in vitro for 
many generations. 

An "immune response" signifies any reaction 
produced by cui antigen, such as a viral antigen, in. a 
host having a functioning immune system. Immune . / 
responses may be either humoral in nature/ that is, ; 
involve production of immunoglobulins or antibodies , or 
cellular in nature, involving various types pf B and T 
lymphocytes, dendritic cells, macrophages, antigen : 
presenting cells and the like, or both.' I^iiine 
responses may also involve the production or elaboration 
of various effector molecules such as cytokines, 
lymphokines and the like. Immune responses may be 
measured both in in vitro and in various cellular or 
animal systems. Such immune responses may be important 
in protecting the host from disease stnd may be used 
prophylactically and therapeutically. 

A "viral cuitigen" shall be any peptide, polypeptide 
or protein seq[uence, segment or epitope that is derived 
from a viirus that has the potential to cause a 
functioning immune system of a host to react to said 
viral antigen. 

An "antibody" or "antibody molecule" is any 
immunoglobulin, including antibodies and fragments 
thereof, that binds to a specific antigen. The term 
includes polyclonal, monoclonal, chimeric, and 
bispecific antibodies. As used herein, antibody or 
antibody molecule contemplates both an intact 
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inrntunoglobulin molecule and an immunologically active 
portion of an immunloglobulin molecule such as those 
portions knovm in the art as Fab, Fab*, F(ab')2 and 
F(v). 

5 As used herein, the term "living host" shall mean 

any non-human autonomous being. 

BRIEF DESCRIPTION OP THE DRAW31I6S J,;; 

Figure 1 shows an alignment of the amino acid 

10 sequences of several NS5B proteins of the invention. 

The approximate positions of conserved sequence motifs 
present in RdRp enzymes identified by Koonin [J Gen 
Virol. (1991) 72:2197-206] (Roman numerals) .and ^^^^ 
al. [EMBO J. (1989) 8:3867-74] (letters):', are indicated . 

15 above the clone 4 sequence. 

Figure 2 shows the purification of the HCV NS5B 
protein. Sf9 cells infected with a recombinant 
baculovirus expressing the HCV NS5B gene were harvested 

20 and lysed. Cleared lysate was subjected to a series of 
protein purification steps. Samples for each step of 
the purification procedure were electrophoresed on an 
SDS- containing polyacrylamide gel, which was then 
silver- stained. Lane 1: cleared cell lysate; lane 2, 

25 flow-through material from DEAE column; lane 3, material 
from heparin column; lane 4, material from Cibracron 
blue column. M = molecular mass standards; numbers in 
kilodaltons. 

30 Figure 3 is an alignment of the clone 4 NS5B 

sequence of the invention with the genotype la consensus 
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NS5B sequence derived from an infectious clone (GenBank 
Accession Number AF009606) and a genotype la NS5B 
sequence {PCT WO 97/12033) previously shown to have RdRp 
activity. The dashed lines indicate amino acid residue 
5 identity. 

Figure 4 shows an alignment of the clone 4 NS5B 
sequence of the invention with a genotype lb "consensus" 
amino acid sequence derived from sequences of HCV NS5B 
10 sequences obtained from the GenBank database and with 
genotype lb NS5B sequences previously shown to be 
functional (GenBank Accession Numbers M58335 and 
Z97730) . The dashed lines indicate amino acid residue 
identity. 

15 

Figure 5 shows a protein sequence alignment of the 
HCV genotype lb NS5B gene reported by Yanagi et al. 
[Virology (1998) 244:161-172; GenBank Accession Number 
AF0542471 and clone 4 of the present invention. The 
20 dashed lines indicate amino acid residue identity - 

Figure 6 shows results of a high throughput assay 
for HCV NS5B RdRp activity. 

25 Figure 7 shows the sequence of SEQ ID NO: 1. 

Figure 8 shows the sequence of SEQ ID NO : 3 . 

Figure 9 shows the sequence of SEQ ID NO : 4 . 

30 

Figure 10 shows the sequence of SEQ ID NO: 5. 
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Figure 11 shows the sequence of SEQ ID NO: 6. 



Figtire 12 shows the sequence of SEQ ID NO: 7. 

Figure 13 shows the amino acid sequence of clone 
20(V-1M), SEQ ID NO: 13. 

DETAILED DESCRIPTION OF THE INVENTION 

HCV is the major causative agent of transfusion- 
associated cuid sporadic non-A, non-B hepatitis- A high 
number of HCV- infected patients develop chronic 
hepatitis that eventually leads to cirrhosis and often 
progresses to hepatocellular carcinoma. There is an 
urgent need for new effective treatments of this 
disease. 

HCV is an enveloped virus with a (+) stranded 
linear RNA genome of approximately 9.4 kilobases. This 
nucleic acid encodes a laarge polyprotein that is 
processed by viral and cellular proteases into at least 
9 different viral polypeptides . 

In one of its aspects, the present invention 
provides HCV genetic material derived from an HCV- 
infected patient. The invention further provides HCV 
recombinant NS5B proteins expressed from these sequences 
and sequences that have RdRp activity. 

Expression of recombinant HCV NS5B gene sequences 
may be carried out in a variety of systems including but 
not limited to bacterial, yeast, mammalian, insect and 
plant cell systems, as well as in organisms such as 
infected, transfected, transduced or transgenic insects, 
animals or plants. In one embodiment of the invention, 
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reconibinant bacailoviruses were constructed to e^^jress 
HCV NS5B gene sequences in insect cells following 
infection in culture. 

RNA extracted from the serum of a patient infected 

5 with HCV was subjected to a reverse transcriptase-nested 
polymerase chain reaction proced\ire using primer 
oligonucleotides designed to anplify the NS5B gene 
sequences. Primers for the nested PGR reaction allowed 
direct cloning of the NS5B gene into a baculovirus 
10 ea^ression transfer vector, providing an initiator 

methonine codon immediately preceeding the first amino 
acid of the authentic NS5B coding sequence. The NS5B 
gene from several transfer vector clones was sequenced 
cuid used to generate recombinant baculoviruses . 

15 Infection of Sf9 cells with these viruses, followed by 
Western immunoblotting with HCV NS5B sequence-specific 
cuitisearum, showed clear production of the 68 kilodalton 
NS5B protein. 

In another aspect of the invention, NS5B sequences 

20 that are expressed, purified and evaluated for RdRp 

activity may possess varying levels of RdRp activity, 
from highly active to little or no activity, and may 
thus represent functional, poorly functional or 
non- functional sequences, respectively- Such sequences 

25 would also be assessed for activity in accordance with 
the present invention. 

In a further aspect of the invention, HCV NS5B 
proteins may be modified by particular changes in 
nucleotide and amino acid sequence that result in RdRp 

30 enzymes with altered functionality. Such changes may be 
subtle and represent conservative substitutions such as 
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in the case of nucleotide sequences, changes in the 
codon seG[uence that do or do not alter the encoded amino 
acid, or for amino acid sequences, changes that result 
in conservative residue substitutions, additions or 
delet ions . . 

In yet a further aspect of .the invention, NS5B 
nucleic acids and polypeptides .derived from sequences of 
the invention have utility in. numerous methods, assays 
and kits involving research, diagnostic, therapeutic and 
pharmaceutical applications, and in the development of 
antiviral strategies for the prevention and treatment of 
HCV disease. }-.\}: 

Based on the discoveries' hdescribed herein, it is 
demonstrated that simple knowledge of, or mere 
inspection of, a nucleotide or amino acid sequence of an 
NS5B gene or protein is insufficient to enable one to 
deduce or predict that the particular NS5B sequence 
represents a sequence that has enzymatic activity or 
functional utility. 

Preparation of HCV NS5B Nucleic Acid Molecules and HCV 
NS5B Proteins and Uses Thereof in Assay Methods and Kits 

A. Nucleic Acid Molecules 

Nucleic acid molecules encoding the HCV NS5B 
proteins of the invention may be prepared by two general 
methods: (1) They may be synthesized from appropriate, 
chemical starting materials, or (2) they may be isolated 
from biological sources. Both methods utilize protocols 
well known in the art. 

The availability of nucleotide sequence 
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information, such as that provided herein for HCV NS5B 
sequences, enables preparation of an isolated nucleic 
acid molecule of the invention by oligonucleotide 
synthesis. Synthetic oligonucleotides may be prepared 
5 by the phosphoramadite method eTr?>loyed in the Applied 

Biosystems 38A DNA Synthesizer or similar devices. The 
resultant construct may be purified according to methods 
known in the art, such as high performance liquid 
chromatography (HPLC) . Long, double- stranded 
10 polynucleotides, such as a DNA molecule of the present 

invention, must be synthesized in stages due to the size 
limitations inherent in current oligonucleotide 
synthetic methods. Thus, for example, a 3 kilobase . 
double -stranded molecule may be synthesized as several 
15 smaller segments of appropriate complementarity. 

Complementary segments thus produced may be ligated such 
that each segment possesses appropriate cohesive termini 
for attachment of an adjacent segment. Adjacent 
segments may be ligated by annealing cohesive termini in 
20 the presence of DNA ligase to construct an entire 3 
kilobase double -stranded molecule. A synthetic DNA 
molecule so constructed may then be cloned and amplified 
in an appropriate vector. 

Nucleic acid sequences encoding HCV NS5B proteins 
25 may be isolated from appropriate biological sources 
using methods known in the art. For example, RNA 
isolated from the serum of an HCV infected patient may 
be used as a suitable starting material for the 
generation of cDNA molecules encoding HCV NS5B proteins. 
30 In accordance with the present invention, nucleic 

acids having the appropriate level of sequence homology 
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with the protein coding region of the DNA molecules of 
the present invention may be identified by using 
hybridization and washing conditions of appropriate 
stringency. For example, hybridizations may be 
5 performed, using a hybridization solution comprising, 
for example, 5X SSC, 5X Denhardt's reagent, 1.0% SDS, 
100 A*g/inl denat\ired, fragmented salmon sperm DNA, 0.05% 
sodium pyrophosphate and up to 50% formamide. 
Hybridization is carried out at 37-42°C for at least six 

10 hours. Following hybridization, filters are washed as 

follows: (1) 5 minutes at room tenperature in 2X SSC cuid 
1% SDS; (2) 15 minutes at room temperature in 2X SSC and 
0.1% SDS; (3) 30 minutes-1 hour at 37°C in IX SSC and 1% 
SDS; (4) 2 hours at 42-65*'C in IX SSC and 1% SDS, 

15 changing the solution every 30 minutes. 

One common formula for calculating the stringency 
conditions required to achieve hybridization between 
nucleic acid molecules of a specified sequence homology 
is as follows (Sambrook et al., 1989) : 

20 

= 81-5"C + 16.6Iiog lNa+] + 0.41 (% G+C) - 0.63 (% formamide) - 
600/#bp in duplex 

As an illustration of the above formula, using [Na+] = 
25 [0.3681 and 50% formamide, with GC content of 42% and an 
average probe size of 200 bases, the T^, is 51°C. The 
of a DNA duplex decreases by 1-1. 5*^0 with every 1% 
decrease in homology. Thus, targets with greater than 
about 75% sequence identity would be observed using a 
30 hybridization temperature of 42 °C. Such a sequence 
would be considered substcoitially homologous to the 
sequences of the present invention. 
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Nucleic acids of the invention may be maintained as 
DNA in any convenient cloning vector. In one 
embodiment, clones are maintained in plasmid 
cloning/expression vectors, such as pBluescript plasmids 
5 (Stratagene, La Jolla, CA) or recombinant baculovirus 
transfer vectors such as pFastBac vectors (Gibco-BRL, . 
Gaithersburg, MD) that are propagated in suitable E. 
coli host cells - 

The nucleic acids of the invention may also be used 

10 as starting materials for the generation of sequence 

variants of the nucleic acids of the invention using any 
number of synthetic and molecular biologic procedures 
well known in the art including but not limited to 
site-directed mutagenesis techniques. Particular 

15 mutations may give rise to HCV NS5B proteins with 

altered characteristics such as increased enzymatic 
activity. 

HCV NS5B protein-encoding nucleic acid molecules of 
the invention include cDNA, genomic DNA, RNA, and 

20 fragments thereof, which may be single- or 

double -straurided in nature. Thus, this invention 
provides oligonucleotides (sense or antisense strands of 
DNA or RNA) having sequences capable of hybridizing with 
at least one sequence of a nucleic acid molecule of the 

25 present invention, such as selected segments of the cDNA 
having substantially the sequence of any of the 
sequences identified in the present invention. Such 
oligonucleotides are further useful as probes and 
primers for detecting or isolating additional HCV NS5B 

30 encoding nucleic acids. 
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B. Proteins 

HCV NS5B proteins of the present invention may be 
prepared in a variety of ways, according to amy niimber 
of known methods. The protein may be purified from 
5 appropriate sources, e.g., cultured cells, tissues or 
organs, by a variety of techniques that may include 
partitioning and precipitation procedures, affinity 
purification methods, conventional chromatography 
procedures, high performance chromatography techniques 

10 and the like. 

The availability of nucleic acids molecules 
encoding HCV NS5B protein enables production of the. 
protein using in vitro expression methods known in the 
art- For example, a cDNA or gene may be cloned into an 

15 appropriate in vitro transcription vector, such as pSP64 
or pSP65 for in vitro RNA synthesis, followed by 
cell -free translation of the RNA in a suitable cell -free 
translation system, such as extracts of wheat germ, 
rabbit reticulocytes or HeLa cells. In vitro 

20 transcription and translation systems are commercially 
available (e.g., Promega Biotech, Madison, WI; 
Gibco-BRL, Gaithersburg, MD) . 

Alternatively, according to a preferred embodiment 
of the invention^ larger quantities of HCV NS5B protein 

25 may be produced by expression in suitable prokaryotic or 
eukaryotic systems such as bacterial, fungal, mammalian 
or plant systems. For example, part or all of a DNA 
molecule, such as a cDNA may be inserted into a plasmid 
vector adapted for expression in a bacterial cell, such 

30 as E. coll, or into a baculovirus vector for expression 
in an insect cell. Such vectors comprise the regulatory 
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elements necessary for expression of the DNA in the host 
cell (e.g., E. coli or insect cell), positioned in such 
a manner as to permit expression of the DNA in the host 
cell. Such regulatory elements required for expression 

5 may include promoter sequences, transcriptional 
initiation and termination sequences, enhancer 
sequences, translational control sequences cuid the like. 

The HCV NS5B proteins or derivatives thereof 
produced by gene expression in a recombinant prokaryotic 

10 or eukaryotic system may be purified according to 
methods known in the art. In one embodiment, a 
commercially available expression/secretion system can 
be used, whereby the recombinant protein is expressed 
and thereafter secreted from the host cell, to be easily 

15 purified from the surroiinding medium. If 

expression/secretion vectors are not used, an 
alternative approach involves purifying the recombinant 
protein from extracts of expressing cells, tissues or 
organs by standard protein purification techniques or by 

20 affinity separation techniques, such as by immunological 
interaction with antibodies that bind specifically to 
the recorabinsmt protein or by nickel columns for 
isolation of recombinant proteins tagged with 5-8 
histidine residues at their N-terminus or C-terminus. 

25 Such methods are commonly used by skilled practitioners . 

The HCV NS5B proteins of the invention, prepared by 
the aforementioned methods, may be analyzed according to 
standard procedures. For example, such proteins may be 
subjected to electrophoretic analyses and to amino acid 

30 sequence analyses, as well as to crystal lographic 

analyses for structure determination according to known 
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methods. Such analyses provide useful information 
regarding . the functionality of the NS5B protein and on 
means to affect that functionality, such as in the 
design of molecules that may inhibit the function of the 
5 NS5B protein. 

C. Assay Methods and Kits 

10 The HCV NS5B sequences of the invention may be used 

in a variety of ways having utility in research, 
diagnostic, therapeutic and pharmaceutical applications. 
Representative methods of use for the conposit ions of 
the invention are described below. . ^ ; \ • 

15 In one aspect, the nucleic acid sequences of the 

invention, and sequences complementary to these, may be 
used as probes or primers for the detection, labeling, 
identification or isolation of related nucleic acids in 
biological or synthetic preparations. For example, 

20 nucleic acid sequences of the invention may be used as 
hybridization probes to detect the presence of HCV in 
samples. Such hybridization probes may further be used 
to isolate the nucleic acids to which they are 
hybridized by techniques well known in the art. 

25 Additionally, the nucleic acid sequences of the 

invention may be used as primers for the detection or 
isolation of HCV or related nucleic acids using 
techniques such as reverse transcriptase -polymerase 
chain reaction (RT-PCR) . Appropriate primers pairs may 

30 be further used in nested PCR applications. Such 
primers, primer pairs and probes may represent any 
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portion of the NS5B sequences of the invention. The 
actual sequence of the NS5B gene used will vary 
according to the specific application. Moreover, 
additional sequences may be. added to the HCV primer or 
5 probe sequence, such as homopolymer tails (tags) , 
sequences that, represent useful restriction enzyme 
recognition sites, sequences encoding particular amino 
acid residues, initiation or termination codons or other 
sequences that may be useful for the particular 

10 application at hand. Typically, oligonucleotides of 

from 10 to 80 nucleotides in length that are either the 
same as or complementary to the sequences of the 
invention are useful as hybridization probes or as 
primers in RT-PCR applications. Alternatively, the 

15 entire NS5B sequence may be employed as a capture 
hybridization probe, for exanple. 

Several examples of useful oligonucleotide primers 
and probes of the invention comprise sequences set forth 
in Tsible 1, secjuences complementary to these and potions 

20 of these sequences. 

Table 1 

Oligonucleotides useful as probes and prijners 

5'-TCAATGTCCTACACATGGAC-3» SEQ ID NO: 14 

5»-CTACACATGG-3' SEQ ID NO: 15 

25 5'-CTCTGATTACACCATGaKnXK:GGAGGAGAGCAAGCTGCCC-3' SEQ ID NO: 16 

5'-AATGCGCTGAGCAACTCTTTGCTGCGCC-3' SEQ ID NO: 17 
5' -CCATAACATGGTCTATGCCACAACATCCCGCAGCGCAAGCCAC3CGGC-3' SEQ ID NO: 18 

5« -GAAGAAGGTAACTTTTGACAGG-3' SEQ ID NO: 19 

5«-CAAGTCCTGGATGACCACTACCG-3» SEQ ID NO: 20 

30 5*-GA06TGCTCAAGGA-3' SEQ ID NO: 21 

5'-ATGAAGGCGAAGGCGTCC-3' SEQ ID NO: 22 

5'-GGAAGAAGCCTGTAAG-3' SEQ ID NO: 23 

5 • -GAACCTATCCAGCAAGGCCGTTAA-3 • SEQ ID NO: 24 
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5' 


-ACCAGA6AAAGGAGGCCGC-3 * 


SBQ 


ID 


NO: 


25 


5' 


-ACCCAGACTTGGGG-3 ' 


SBQ 


ID 


HO: 


26 


5' 


-TCTCCACCCTTCCTCAGGCT-3 • 


SBQ 


ID 


MO: 


27 


5' 


-CGAGTTCCTGGTGAATGCC-3 ' 


SBQ 


ID 


NO: 


28 


5' 


-TGCCCTATGGGCTTCGC31TATGAC-3 ' 


SBQ 


ID 


NO: 


29 


5« 


-TTTCGACTOWVCGGTaiCCXSAGJ^^ ' 


SBQ 


ID 


NO: 


30 


5» 


-GTTGAGGAGTCAATr-3 » 


SEQ 


ID 


NO: 


31 


5' 


-TTGGCCCCCX3AAGCCAGACA-3 ' 


SEQ 


ID 


NO: 


32 


5' 


- AAGGTCGCTTACAGAGC - 3 ' 


SEQ 


ID 


NO: 


33 


5' 


-ATCGGGGGTCCCCTGAC-3 ' 


SBQ 


ID 


NO: 


34 


5' 


-TAACTCAAAA6GGCAGA6-3 • 


SBQ 


ID 


NO: 


35 


5' 


-ATGTTACrrGAAGGCCTCT-3 • 


SEQ 


ID 


NO: 


36 


5 


-GATGCTTGTGTGCX5GAGACGACCTC-3 ' 


SBQ 


ID 


NO: 


37 


5 


-GGTCGCGCACGATGCATCTGGCAAAAC3GGTA-3 ' 


SEQ 


ID 


NO: 


38 


5 


-CACCACCCCTCTrGCGCGG-3 • 


SEQ 


ID 


NO: 


39 


5 


-CTCCATCCTTCTAGCTCAGGAGCAA-3 ' 


SEQ 


ID 


NO: 


40 


5 


' -AGTTACTGTCCCAGGGGGGG-3 ' 


SEQ 


ID 


NO: 


41 


5 


• -TCCGGCTGCGTCCCAGT-3 • 


SEQ 


ID 


NO: 


42 


5 


• -CCGACCCCX3CTGGTTCATGTGGTGCC-3 ' 


SEQ 


ID 


NO: 


43 


5 


' -CTACXrrGCTCCCGAACCGA-3 • 


SBQ 


ID 


NO: 


44 


5 


• -CTACCTGCTCCCCAACCGA-3 • 


SEQ 


ID 


NO: 


45 



Additionally, the nucleic acid sequences of the 
invention may be used as primers for the generation of 
variants or mutants of the sequences of the invention 
using a variety of methodologies known in the art, 
including site-directed mutagenesis procedures. 

In another aspect, the nucleic acid sequences of 
the invention may be used in the construction or 
generation of, or incorporated into, infectious viruses, 
vectors or replicons. Provision or substitution of the 
functionally superior NS5B sequences of the invention 
for poorly functional or non- functional counterparts 
will serve to improve the infectious and replicative 
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characteristics of the resulting virus or replicating 
unit. For exanple. Rice et al. (PCT WO 98/39031) 
provide an infectious nucleic acid of HCV. The NS5B 
polymerase encoded by this infectious clone is shown in 

5 the present invention to be sxibstantially inferior in 
its RdRp activity to those of the present invention. 
Therefore, substitution of the NS5B encoding gene of 
Rice et al. with those of the present invention would be 
expected to result in improved virus replication due to 
10 the in?>roved characteristics of the NS5B gene and 
encoded protein of the present invention. Such 
substitutions may be carried out by standard genetic 
engineering procedures well known in the art. The 
resulting infectious nucleic acid would have 

15 consideredDle advantages over current infectious clones 
of HCV, including, but not limited to, inproved or 
higher level viral RNA synthesis, improved levels of 
infectious virus production cuid improved virus 
replication in living hosts and in in vitro systems. 

20 Use in other systems in which the function of the HCV 
NS5B polymerase is inportcuit, such as in complementing 
or trans -complementing systems, replicon systems, 
defective viruses, defective interfering particles and 
the like, would also benefit from the use of the nucleic 

25 acid sequences of the invention. 

In another embodiment, the nucleic acid sequences 
of the invention may be used in methods to elicit immune 
responses to the HCV NS5B protein. For example, the 
NS5B- encoding nucleic acid sequences of the invention 

30 operationally linked to an expression operon may 
introduced directly into cells, particularly into 
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antigen presenting cells such as dendritic cells, of a 
living host or hximcin possessing a fxinctioning itranune 
system. Introduction of the sequences may utilize 
transf ection, transformation, or tramsduction methods, 
5 or involve the physical uptake of particles coated with 
the NS5B nucleic acid sequences, such as plasmid- coated 
gold particles. Once inside cells, the NS5B seqpiences 
are expressed, processed and presented to the host's 
immune system. Such methods are useful in the 
10 elicitation of humoral and cellular immune responses in 
a living host euid in humans and in vaccines for HCV. 

The nucleic acid sequences of the invention may be 
further used in the generation of cell lines or cellular 
systems that express the HCV NS5B protein. Such cell 
15 lines in which a functional NS5B protein is expressed 
from the NS5B genes of the invention will have utility 
in methods for assaying materials for antagonistic or 
agonistic activity toward HCV. For example, assays may 
be established whereby intact cells expressing an HCV 
20 NS5B protein of the invention are contacted with agents 
or materials suspected of affecting the intracellular 
activity of the HCV NS5B protein, and the affect of such 
agents on the HCV NS5B activity is measured. The affect 
of such agents on the HCV NS5B activity may be measured 
25 in any nundber of ways. For example, RNA synthesis that 
is directly or indirectly dependent on the HCV NS5B 
polymerase activity may be qucuitif ied. In one msuiner, 
the amoxint of a radiolabeled precursor of RNA (e.g., ^H- 
uridine) that is incorporated into trichloroacetic acid- 
30 precipitable RNA that is dependent on NS5B activity may. 
be measured. 
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Alternatively, such cell systems may utilize a 
reporter system in which the production of the reporter 
signal is dependent on RNA synthesis by the HCV NS5B 
polymerase. In one embodiment, a RNA substrate of the 

5 HCV NS5B polymerase is provided that is the antisense 
strsuxd of an mRNA, the sense strand (mRNA) of which is 
effectively translated to produce a polypeptide capable 
of being detected or of producing a detectable signal 
(the reporter) . For exanple, an RNA molecule is 

10 provided comprising the sequence complementary to the 
coding sequence of lucif erase (cuitisense strauid) . The 
activity of the HCV NS5B polymerase on this RNA results j 
in the production of the sense strand of the luc i f erase .; 
. gene, which is then translated by the cellular 

15 trcuislational system to produce lucif erase protein. The 
luciferase protein then may be detected by antibodies to 
the luciferase protein or by measurement of luciferase 
enzymatic activity in intact cells or in cellular 
extracts using a luminometer or other similar device. 

20 Numerous other reporters may serve equally well in this 
application including but not limited to, p- 
galactosidase, alkaline phosphatase, fluorescent green 
protein and the like. Furthermore, the cell systems 
that may be used in this method of the invention may be 

25 of bacterial, fungal, insect, avian, mammalian or plant 
origin. 

Further, the nucleic acid sequences of the 
invention may be used in assays to identify agents or 
materials capable of interacting with or affecting the 
30 HCV NS5B nucleic acid secjuences. For example, assays 



39 



wo 99/51781 PCTAJS99/07404 

may be established in which nucleic acid sequences of 
the invention are provided and then contacted with 
agents or materials suspected of interacting with such 
sequences. Agents identified in such interaction assays . 
5 would then have potential diagnostic utility and uses in 
the detection of HCV in, for exanqple, biological 
sanples. Such agents would also have potential utility . 
in applications involving the prevention or treatment of 
HCV disease in am affected living host, including 

10 humans, and for the inhibition or enhancement of HCV 
replication or propagation in living hosts and in in 
; vitro systems such as cell, tissue cuid organ cultures. ; •; 
Additional applications may be envisioned once the , 
nature of the particular agent is clear. 

15 The HCV NS5B protein conpositions of the invention 

also have broad utility. In diagnostic applications, 
for example, the NS5B proteins, or peptides thereof, may 
be used in assays for the detection of immune responses 
to the same. For example, protein sequences or peptides 

20 of the invention may be used in assays in which said 
sequences are immobilized on a matrix and used to 
capture antibodies directed to said sequences. 
Additionally, protein sequences or peptides of the 
invention may be used to detect or measure cell -mediated 

25 immune responses to the protein, such as in immune cell 
proliferation assays. 

The HCV NS5B protein compositions of the invention 
also have potential utility in the elicitation of immune 
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responses, such as in vaccines. For exarrple, provision 
of the NS5B proteins of the invention, or peptides 
thereof, to a living organism with a fimctioning iiramine 
system will cause such organism to mount an immune 
5 response to the NS5B sequences. The NS5B sequences may 
be presented to the living organism in any number of 
assays well known to those trained in the art and 
include, but are not limited to, providing free protein 
or peptides, formulated protein or peptides, adjuvemted 

10 protein or peptides, protein or peptides in the context 
of intact or disrupted cells in which NS5B sequences are 
present and other such manners. Immune responses so 
elicited may be either humoral or cellular in nature, or 
both. Such immune responses may be important in 

15 protecting living hosts from HCV disease and may also 
provide or serve as a source of useful immunological 
reagents such as antibodies, that may further have 
therapeutic or diagnostic utility. The NS5B proteins of 
the invention, or peptide portions thereof, may further 

20 be used to select or purify such antibodies to NS5B. 

For exaniple, NS5B protein may be immobilized and used to 
bind antibodies specific to the NS5B protein, and thus 
enrich for such antibodies. Furthermore, the NS5B 
proteins and peptides of the invention may be used to 

25 produce monoclonal antibodies to the NS5B protein using 
standard techniques known in the art. Antibodies to the 
NS5B protein, whether polyclonal or monoclonal, may be 
further evaluated for their ability to affect the 
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enzymatic activity of the NS5B RdRp activity. 

Several examples of portions of the NS5B protein 
sequences useful in the above applications conprise 
sequences or portions thereof, provided in Table 2. 

Table 2 

Peptides Useful in Methods of the Invention 

SMSYTWTGALITPCAAEESKLPINALSNSLIjIfflHlJMVYACT 
10 KVTFDRIiQVLDDHYRDVIiKEMKAKASTVKA SEQ ID NO: 46 

SVBKACKLTPPHSARSKFGYGAKDVRNLSSKAViraiHSVWKDIiro SEQ ID NO: 47 

ETPIDTTIMAKNEVFCVQPEKGGRKPARLIVYPDLGVRVC SBQ ID NO: 48 

15 SSYGFQYSPGQRVEFLVMAWKSKKCPMGFAYDTRCFDSTVTBNDIRVEESIY 

QCCDIAPEARQAIRSIiTERLYIGGPLTNSKGQSCGYRRCRASGVLTTSCG SBQ ID NO: 49 

CTMLVCGDDLVVXCBSAGTQEDAASLRVFTEANTRYSAPPGDPPQPEYDLBLITSC 

SBQ ID NO: 50 

20 SVMDASGKRVYYIiTRDPTTPIARAAWETARHTPVNS SEQ ID NO: 51 

CLRKLGVPPLRVWRHRARSVRAKLIiSQGG SBQ ID NO: 52 

FVAGYSGGDIYHSLSRARPRWFMWC SBQ ID NO: 53 



25 The protein compositions of the invention have 

utility in assays for the detection and identification 
of agents capable of interacting with or affecting the 
HCV NS5B protein. Assays may be established in which 
HCV NS5B polypeptide sequences of the invention are 

30 provided and then contacted with agents or materials 
suspected of interacting with such sequences. For 
example, upon provision of the HCV NS5B protein of the 
invention, or fragment or portion thereof, contacted 
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agents may be assessed for their ability to bind 
specifically to the protein. Such binding agents would 
then have potential diagnostic utility and uses in the 
detection of HCV in, for example, biological sainples. 
5 Such binding agents may further affect the fxinctional 
activity of the HCV protein, such as either inhibiting 
or enhancing the HCV NS5B function. Agents that inhibit 
the function of the NS5B protein would have potential 
utility in applications involving the prevention or 
10 treatment of HCV disease in an affected living host or 
for the inhibition of HCV replication or propagation in 
living hosts, including hiamans, in in vitro systems such 
as cell, tissue and organ cultures or in biological 
materials. Agents that enhance the function of the NS5B 
15 protein would have potential utility in applications 

involving the replication, propagation or production of 
HCV in living hosts, such as in animal models of HCV 
replication, and in in vitro systems such as cell, 
tissue and organ cultures. 
20 In another embodiment, methods of assay are 

provided in which the HCV NS5B polymerase activity 
furnished by an enzymatically active NS5B protein of the 
invention is measured directly. Agents placed in 
contact with said enzymatically active NS5B polymerase 
25 may be assessed for their ability to specifically affect 
this enzymatic activity. Such enzymatically active 
polymerase may be provided in an extract or lysate of a 
cell in which the polypeptide was produced, in an in 
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vitro cell -free expression system or in an enriched or 
purified form. 

There are numerous means by which the 
enzymatic activity of the HCV NS5B protein provided in 
5 an extract, cell-free system or enriched form may be 

assessed, and these are well known in the art. NS5B- ' 
dependent RNA synthesis typically requires certain . 
reaction components including minimally a buffered 
medium, a divalent cation, precursors of RKA (nucleoside 

10 triphosphates, NTPs) , an RNA template and a primer for , 
RNA synthesis on that template. Additional components 
may include monovalent cations, reducing agents, ' '-'S/zh^l 
stabilizers, cof actors and inhibitors of activities / • J- 
unrelated to the NS5B RdRp activity such as inhibitors 

15 of RNase, phosphatase, kinase, phosphotransferase and 
similar activities. 

Measurement of NS5B-dependent RNA synthesis may be 
assessed in numerous maumers. In one example, the 
incorporation of a precursor of RNA into a polymer of 

20 RNA is measured, such as the incorporation of a 

radiolabeled NTP into trichloroacetic acid-precipitable 
RNA, which may then be quantified by scintillation 
spectrometry or phosphorimaging technologies. Such 
precursors may alternatively be tagged with other 

25 moieties to allow their ready detection such as with 
biotin for detection with avidin reagents including 
various avidin conjugates such alkaline phosphatase and 
the like or with fluorescently- labeled NTPs for 
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detection using fluorescent technologies such as 
fluorescence polarization. 

NS5B-dependent RNA synthesis may also be assessed 
by measxiring the extension of a pre- labeled or tagged 
5 primer of RNA synthesis such as a radiolabeled or 
biotin- tagged. oligonucleotide that is used by the 
polymerase to initiate RNA synthesis on a template RNA 
moleciile. Extension of the primer may be assessed by 
quantifying the addition of nucleoside triphosphates to 

10 the primer, by determining the length of the primer 
product, or by other methods known in the art. 

Altematively, the product of NS5B RjdRp activity ' . 
may be detected and quantified by/capture of the product 
RNA using hybridization techniques. For example, an *. 

15 oligonucleotide complementary to the product of the NS5B 
RdRp reaction may be introduced during or after the 
reaction and hybridized to the product. The extent of 
hybridization of the added oligonucleotide may be used 
as a measure of the amount of product RNA present in the 

20 mixture and may be assessed by various means known in 
the art. 

Other means of detection of the products of NS5B 
RdRp activity are readily known to, or can be envisioned 
by, the skilled artisan and are fully contemplated here. 

25 Assays involving the nucleic acid and polypeptide 

compositions of the invention may be formatted in any 
number of configurations. Particularly useful for 
evaluating large numbers of agents and materials are 
high throughput screening formats. Traditionally such 

30 assays were typically formatted in 96 well plates. 

However, 384, 864 and 1536 well plates may be used in 
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such high throughput assay systems. These systems are 
often automated using robotics technologies to allow 
manipulation cind processing of large numbers of samples. 
The agents or materials that may be evaluated in 
5 the various assay methods of the invention for potential 
antagonistic or agonistic affects include but are not 
limited to small molecules, polymers, peptides, 
polypeptides, proteins, immunoglobulins or fragments 
thereof, oligonucleotides, antisense molecules, peptide- 
10 nucleic acid conjugates, ribozymes, polynucleotides and 
the like. The potential utility of agents or materials 
identified using the compositions and assay methods of 
the invention will be broad and will include uses for 
the detection emd isolation of HCV nucleic acids and 
15 polypeptides, for the detection or diagnosis of HCV, for 
the prevention cuid treatment of HCV disease in an 
affected living host, including humans, and for the 
inhibition or enhcuxcement of HCV replication or 
propagation in living hosts and in in vitro systems such 
20 as cell, tissue and organ cultures, as well as for other 
uses the may be envisioned once the nature of the agent 
is clear. 

Another feature of the invention includes kits to 
facilitate the use of the compositions and methods 

25 disclosed herein. Exemplary kits would include HCV NS5B 
nucleic acids and polypeptides of the invention, and/or 
variants thereof, alone or in suitable vectors. Also 
included would be protocols for use of the compositions 
of the invention for the particular application and the 

30 necessary reagents to carry out the application. Such 
reagents may include, but not be limited to, buffers, 
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solvents, media and solutions, substrates and cof actors, 
vectors and host cells, eoid detection or reporter 
reagents. Accessory items may include vials, vessels, 
reaction chambers and instruction sheets. 

The following examples are provided to describe the 
invention in further detail. These examples, which set 
forth the preferred mode presently contemplated for 
carrying out the invention, are intended to illustrate 
and not to limit the invention. 

EXAMPLE 1 

Cloning and Expression of HCV NS5B Genes 

RNA isolated from the serum of an HCV- infected 
patient was used to amplify the HCV NS5B gene using an 
RT-nested PCR protocol. Using the following first roxmd 
primers: 5'-TGA GGA TGT CGT CTG CTG CTC AAT GTC C-3' 
and 5'-GGG ATG GCC TAT TGG CCT GGA GT-3 ' , the RT 
reaction was performed at 50**C for 50 minutes followed 
by a 1 minute 94**C denaturation, followed by 40 cycles 
of first round PCR, 15 cycles of 94 ""C for 30 seconds, 
50°C for 30 seconds 68°C for 2 minutes and 25 cycles of 
94**C for 30 seconds, 50**C for 30 seconds, eS^'C for 2 
minutes . 

A portion of first round PCR reaction mixture was 
then used in the second roimd PCR reaction in which the 
following nested primers were incorporated: 5'-AAC AGA 
• TCT GAA TTC TTA TAA ATA TGT CAA TGT CCT ACA CAT GGA C-3' 
and 5 ' -TGC TCT AGA GCG GCC GCT CAT CAT CGG TTG GGG AGC 
AGG TAG- 3 ' , which included EcoRI and Not I restriction 
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sites, respectively (xinderlined) . f or subsequent cloning 
purposes. The second round PGR involved an initial 
denaturation at 94 °C for 1 minute /.followed by 10 cycles 
of 94''C for 45 seconds, 50°C for 30 seconds, 68°C for 2 
5 minutes and 20 cycles of 94 °C for, 30 seconds, 50°C for 
30 seconds, 68 "^C for 2 minutes, -'i^- J ^ 

The resultant PGR product was purified, digested 
with EcoRI £uid NotI emd ligated to pFastBac plasmid 
(Gibco-BRL) previously digested With EcoRI emd Notl. 

10 The ligation mixture was transformed into DH5 B- coli 

cells. Plasmids from bacterial colonies containing the 
HCV NS5B gene were used to generate a recombinant 
baculovirus by the transposition'' method of the 
Bac-to-Bac system (Gibco-BRL) • '/After transformation of 

15 the HCV NS5B gene -containing pPastBac plasmid DNA into 
coli DHlOBac cells, several colonies containing 
bacmid DNA were transfected into Sf 9 insect cells 
according to the protocol supplied by the manufacturer. 
NS5B protein expression in recombinant 

20 baculovirus -infected cells was verified by Western 

iraraunoblot analysis with antiseirum specific to the HCV 
NS5B sequences. 

Six independently isolated clones (clones 4, 14, 
21, 11, 16 and 20) of the HCV NS5B gene from a single 

25 RNA preparation derived from one patient were sequenced. 
The nucleotide sequences of these clones are provided by 
SEQ ID NO: 1, 3, 4, 5, 6 and 7, respectively. All six 
sequences were derived from a genotype lb virus and are 
closely related, but not identical, to one another. 

30 Each of the sequences set forth above is unique and 
novel sequence, not presently listed in the GenBank 
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database. The deduced amino acid sequences of 
nucleotide sequences identified by SEQ ID NO: 1, 3, 4, 
5, 6 and 7 are provided by SEQ ID NO: 2, 8, 9, 10, 11 
and 12, respectively. These NS5B amino acid sequences 
5 are aligned relative to clone 4 (SEQ ID NO: 2) in Figure 
1- As revealed in this alignment, the sequence of the 
NS5B protein of clones 14, 21, 11, 16 and 20 differs 
from that of clone 4 by 2, 5, 3, 4 and 3 amino acids, 
respect ively . 



EXAMPLE 2 ■ 
Purification of HCV NS5B Proteins / 

15 

HCV NS5B proteins may be obtained in purified form 
from recombinant protein-eacpressing cell systems by any 
number of procedures known to the trained artisan, 
several of which are exemplified in Current Protocols in 

20 Molecular Biology, Frederick M. Ausubel et al. eds., 
John Wiley & Sons, 1995 which is incorporated by 
reference herein. 

In the case of the present example, baculovirus 
clone 4 -infected insect cells were disrupted in lysis 

25 buffer (50% glycerol, 20 mM Tris-HCl, pH 7.5, 10 mM 

dithiothreitol (DTT) , 0.5 M NaCl, 1 mM EDTA, 2% Triton 
X-100) supplemented with a Complete Protease Inhibitor 
tablet (Boehringer Mannheim) . MgClj was then added to a 
final concentration of 10 mM, followed by 10 units DNase 

30 I (RQ-1, Promega) . After 30 minutes on ice, the lysate 
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was clarified by cent rifugat ion at 35,000 rpm for 35 
minutes at 4**C. 

The clarified lysate was diluted to a final NaCl 
concentration of 0.3 M with elution buffer (20% 
5 glycerol, 20 itiM Tris-HCl, pH 7.5, 10 ttiM DTT, 1 niM EDTA, 
0.5% Triton X-100) , and incubated with DEAE Sepharose 
ecpiilibrated in elution buffer containing 0.3 M NaCl at 
^'^C. The mixture was then poured into a column, and 
flow- through material was collected. Flow- through 

10 material was diluted to a final NaCl concentration of 
0.2 M with elution buffer and loaded onto a heparin 
Sepharose column equilibrated in elution buffer 
containing 0.2 M NaCl. Boimd proteins were eluted with 
a linear gradient of NaCl (200 mM to 1 M NaCl in elution 

15 buffer) while fractions were collected. 

NS5B- containing fractions were pooled and loaded 
onto a Cibacron Blue column equilibrated in elution 
buffer containing 0.4 M NaCl. Bound proteins were 
eluted with a linear gradient of NaCl (400 mM to 4M NaCl 

20 in elution buffer) while fractions were collected. 
NS5B-containing fractions were pooled and dialyzed 
against 50% glycerol, 10 mM Tris-HCl, pH 7.2, 50 mM 
NaCl, 1 mM EDTA, 0.01% Triton X-100 and stored at -20*'C, 
Typical purification results are shown in Figure 2. 

25 

EXAMPLE 3 

Enzymatic Activity of Purified HCV NS5B Proteins 

30 There are numerous methodologies for the 

measurement of RNA-dependent RNA polymerase activity 
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that are well known to one of ordinary skill in the art. 

One approach for measuring the HCV NS5B RdRp 
activity uses a purified recoinbinant NS5B protein in an 
in vitro RdRp assay. For exatiple, Behrens et al. [EMBO 
5 J. 15:12-22 (1996)] describe the baculovirus expression, 
purification and enzymatic activity of the HCV NS5B RdRp 
derived from the BK strain of HCV. In another example, 
PCT WO 97/12033 [PCT/US96/15571] discloses the bacterial 
expression, purification and enzymatic activity of the 

10 HCV NS5B RdRp derived from an HCV sample obtained from 
the Centers for Disease Control and several truncated 
and modified versions of this sequence. Another example 
is Lohmann et al. [J Virol 71:8416-8428 (1997)1, in 
which the NS5B gene derived from a chronically infected 

15 patient was expressed with recombinant baculoviruses in 
insect cells, purified and enzymatically evaluated. Yet 
another example is Yucui et al. [Biochem Biophys Res Comm 
232:231-235 (1997)], in which the bacterial expression, 
purification and enzymatic activity of the HCV NS5B RdRp 

20 derived from a patient with chronic sporadic hepatitis 
is described. 

Purified NS5B protein prepared according to Example 
2 was incubated in a 10 /zL standard reaction mixture 
consisting of 20 iriM HEPES, pH 7.5, 3 mM MgCla, 1 mM 
25 dithiothreitol, 400U/mL RNasin (Gibco/BRL) , 0.5 iriM each 
of OTP, ATP and CTP, 0.1 fM P^PlGTP and 0.03 pOF1213 
RNA at 30**C for 60 minutes. The reaction is terminated 
by the addition of cold trichloroacetic acid (TCA) and 
sodium pyrophosphate. The TCA-precipitable 
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radioactivity was then quantified to determine the 
extent of RdRp activity. 

In addition to the NS5B proteins derived from 
clones 4, 14, 21, 11, 16 and 20 of the invention, the 
5 NS5B gene of the sequence derived from the genotype la 
consensus sequence infectious clone as described by 
Kolykhalov et al. (GenBank Accession Number 7iF009606) 
was esqpressed and purified in the same manner as the 
proteins of the invention. Table 3 presents a direct 
10 comparison of the enzymatic activities of the NS5B 

proteins of the invention aind the consensus sequence 
protein of Kolykhalov. 

Table 3 

Coznparison of RdRp Activities of Various HCV NS5B 
15 . Proteins 



specific amino acid changes 

Protein activity* relative to clone 4 

clone 4 (SKQ ID NO: 2) 9.3 

20 clone 14 (SBQ ID NO: 8) 1.7 2 

clone 21 (SEQ ID NO: 9} 1.9 5 

clone 11 (SEQ ID NO: 10) 0.2. 3 

clone 16 (SEQ ID NO: 11) 0.1 4 

clone 20 (SEQ ID NO: 12) <0.1 3 

25 AF009606 0.9 71 



* pmole NTP incorporated per /zg NS5B protein per 60 minutes 
under standard reaction conditions. 

30 Surprisingly, a considerable range of enzymatic 

activities was observed among these distinct NS5B 
proteins . The NS5B protein derived from clone 4 
possessed the. greatest activity in this comparison, 
while the activities of clone 14 and 21 were moderate, 

35 those of clone 11 and 16 were minimal and that of clone 
20 was \ande tec table. The activity of the genotype la 
consensus NS5B protein of Kolykhalov was substantially 
less thsm that clone 4 of the invention. Lohmann et al. 
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reported a specific activity of their genotype lb NS5B 
protein on a heteropolymeric RNA template to be 1.7 
pmol/Mg/120 minutes, similar to the activity reported 
here for the Kolykhalov enzyme, but also significantly 
5 below that of the clone 4 enzyme. 

This exanple demonstrates a range of functional 
activities among cloned ctnd expressed NS5B sequences, 
from very active to inactive. In this analysis of 6 
NS5B proteins of the invention, there are three 
10 instances (clones 11, 16 and 20) of NS5B proteins that 
are poorly functional or non- functional and three 
instances (clones 4, 14 euid 21) of NS5B proteins that 
are functional. 

It is further revealed here that surprisingly very 
15 few changes in amino acid sequence can be sufficient to 
dramatically alter the enzymatic activity of the NS5B 
protein. While sequence changes within known or 
predicted active sites or conserved sequence motifs of 
an enzyme might be expected to effect an enzyme's 
20 activity, most amino acid changes among the 6 NS5B 
sequences of this comparison lie outside consensus 
motifs of RdRp enzyme identified by Koonin [J. Gen. 
Virol. 72:2197-2206 (1991)] and Poch et al. [EMBO J. 
8:3867-3874 (1989)]. The two exceptions to this are as 
25 follows. Clone 16 possesses a single residue chcinge 

within motif III and VI, and clone 21 possesses a single 
conservative amino acid change within motif VI (Figure 
1) . This notwithstanding, the effect of these amino 
acid changes, and of the others among the NS5B sequences 
30 of this example, on the RdRp activity of the NS5B 
protein is surprising and could not have been 
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anticipated from prior art. 

The data presented in this exanple also demonstrate 
that the NS5B RdRp activity derived from clone 4 has 
dramatically superior activity over that of the NS5B 
5 derived from the infectious clone identified by GenBank 
Accession Number AF009606. This result indicates that 
the mere generation of a consensus sequence of an NS5B 
protein does not necessarily provide an NS5B protein 
with optimal functionality. 

10 

EXAMPLE 4 
HCV NS5B Sequence Allgsments 

15 The NS5B secjuences of the invention are distinct 

from NS5B sequences previously characterized as 
fimctional. This is demonstrated by the sequence 
alignments presented in Figures 3, 4 and 5. 

The . sequence disclosed by Kolykhalov et al. 

20 (GenBank Accession Number AFO 09606) represents a 

consensus secjuence of known NS5B sequences derived from 
genotype la HCV. Figure 3 shows an alignment of the 
amino acid sequence of clone 4 (SEQ ID NO: 2) of the 
invention with that of this genotype la consensus 

25 sequence. There are 71 amino acid differences between 
these two sequences (excluding the initiator methionine 
residue in clone 4) . This demonstrates that the clone 4 
NS5B sequence of the invention is distinct from the 
genotype la consensus sequence. Alignment of this 

30 genotype la consensus sequence with all other sec[uences 
of the invention similarly demonstrates the distinct 
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nature of the sequences of the invention. Also included 
in the alignment in Figure 3 is the genotype la sequence 
disclosed in PCT WO 97/12033, the NS5B protein from 
which was demonstrated to possess RdRp activity, 
5 Inspection of this alignment reveals 72 amino acid 

differences between the sequences, again demonstrating 
the unique nature of the sequences of the invention. 

The sequences of the invention are also distinct 
from a genotype lb consensus sequence. A genotype lb 

10 consensus sequence was generated based on alignment of 
29 genotype lb NS5B sequences found in the GenBank 
datcibase. Comparison of this consensus secpience with ; 
that of. the clone 4 (SEQ ID NO: 2) of the invention' 
shows that these sequences are distinct (Figure 4). . 

15 After the initiator methionine (at position -1) , the 

clone 4 sequence differs from this genotype lb consensus 
sequence by 13 amino acids. Alignment of this genotype 
lb consensus sequence with all other sequences of the 
invention similarly demonstrates the distinct nature of 

20 the secjuences of the invention. Also included in the 
alignment in Figure 4 are the genotype lb sequences 
disclosed by Behrens et al. (GenBank Accession Number 
M58335) and by LohmcUin et al. (GenBank Accession Number 
Z97730) . The NS5B proteins encoded by these sequences 

25 have been previously demonstrated to possess RdRp 
activity. Clone 4 is again distinct in that it 
possesses 17 amino acid changes relative to M58335 and 
12 changes relative to Z97730. 

The sequences of the invention are further distinct 

30 from a genotype lb sequence derived from an infectious 
clone [Yanagi et al. Virology 244:161-172 (1998); 
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GenBank Accession Number AF054247] , As shown in 
alignment presented in Figure 5, the clone 4 sequence 
(SEQ ID NO: 2) differs by 25 amino acid residues, 
excluding the methionine residue at position Dl in clone 
5 4- 

These alignments reveal the unique nature of 
sequences of the invention relative to these genotype lb 
sequences . 

Therefore, based on comparisons with both genotype 
10 la and genotype lb consensus sequences, the NS5B 

sequences of the invention do not represent a consensus 
sequence. Thus, based on the consensus sequence method 
for deducing sequence functionality, it would not be 
apparent that clone 4 is a functional RdRp or a 
15 functionally superior RdRp. Moreover, based on 

comparisons with NS5B sequences previously demonstrated 
to have functional RdRp activity, the sequence of clone 
4 and of the other sequences of the invention are 
distinct from these RdRp sequences, and again, would not 
20 a priori be assumed to possess RdRp activity, and 
further not suspected of possessing superior RdRp 
activity. 



25 EXAMPLE 5 

HCV NS5B Proteins with Improved Activity 

Based on sequence comparisons among NS5B sequences 
of the invention, a number of modified NS5B sequences 
30 were constructed in which particular amino acids were 
changed. These modified sequences were generated by 
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Standard site- specif ic mutagenesis procedures [Picard et 
al. Nucleic Acid Res. 22:2587-2591 (1994)1. Modified 
genes were engineered in the baculovirus expression 
system as described in Example 1, purified according to 
5 Example 2 and evaluated for RdRp activity according to 
Exan5)le 4. . 

The sequence of clone 20 differs from that of clone 
4 at three amino acid residues. The presence of a 
valine (V) residue at position -1 of the clone 20 NS5B 
10 protein instead of an initiator methionine (M) suggests 
that the NS5B protein produced from clone 20 is 
initiated at the methionine at position 2, and fxirther, 
that protein produced from this methionine residue is 
inactive. The V change in clone 20 is likely an 
15 artifact of molecular cloning procediires (mistaken 

sequence in a primer oligonucleotide) . The V residue 
position -1 in clone 20 was changed to M, yielding clone 
20(V-1M), identified by SEQ ID NO: 13, which now differs 
from the sequence of clone 20 by one amino acid (at 
20 residue -1) and from clone 4 by two amino acids (at 
residues 177 and 543) . The RdRp activity of the 
purified NS5B protein derived from clone 20(V-1M) was 
dramatically increased over that of clone 20 and 
signif icantly improved over that of clone 4 (TaJDle 4) . 
25 Additional residue changes among the NS5B secpiences 

of the invention can lead to useful and functional 
proteins. For example, when the alanine (A) residue at 
position 75 in clone 4 is changed to valine (V) , the 
residue found at this position in clone 14, the 
30 resulting clone, clone (A75V) , shows considerable RdRp 
activity (Table 4) . In another example, the asparagine 
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(N) residue in clone 4 at position 177 was changed to 
aspartic acid (D) , the residue foimd at this position in 
all other clones of the invention, yielding clone 
4 {N177D) . This clone showed very good RdRp activity 
5 (Table 4) . In a third example, when the serine (S) at 

position 543 in clone 4 is changed to proline (P) , as is 
fotmd in clone 20, yielding clone 4(S543P), a functional 
RdRp is generated (Table 4) . 

Table 4 

10 RdRp Activities o£ Various HCV NS5B Proteins 

Protein# specific activity * 



clone 


20 


(SEQ ID NO: 


12) 


< 0. 


1 


clone 


20{V-1M) 


(SEQ ID NO: 


13) 


23. 


5 


clone 


4 


(SEQ ID NO: 


2) 


9. 


3 


clone 


4 (A75V) 


(SEQ ID NO: 


54) 


3. 


5 


clone 


4 (N177D) 


(SEQ ID NO: 


55) 


7. 


2 


clone 


4 (S543P) 


(SEQ ID NO: 


56) 


3. 


5 



20 

* pmole NTP incorporated per /xg NS5B protein per 60 minutes 
under standard reaction conditions. 

# All of the changes in table 4 may also be made to the other 
NS5B encoding amino acid sequences set forth herein i.e. SEQ 

25 ID NOS: 8, 9, 10, 11, 12, 13. and thus are contemplated to 

be within the scope of the invention. 

This example demonstrates that amino acid residue 
changes may be made in the sequence of the NS5B protein 

30 and that such changes may maintain the functional 

activity of the resulting protein. Moreover, the amino 
acid s\ibstitutions may be either conservative or non- 
conservative in nature. 

Additional changes of the amino acid residues among 

35 the NS5B sequences of the invention in various 

combinations are contemplated to be within the scope of 
the invention. 
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EXAMPIiE 6 

Novel HCV NS5B Proteins with Ixnproved Activity 

Another strategy for investigating functional NS5B 
5 seq[uences involves the introduction of unique amino acid 
stibsti tut ions .at positions where amino acid residues are 
conseirved in known NS5B sequences. For example, all ^ 
known NS5B sequences have as the amino terminal andno 
acid (position 1) a serine residue. Several modified, 

io versions of clone 4 were constructed in which this 

serine was substituted with either alouiine [clone ... ;.;.v^^ 
4(S1A)], glycine [clone 4(S1G)] or threonine [clone v/;/;,/ 
4 (SIT)]- Surprisingly, all of the NS5B proteins derived 
from these novel sequences exhibited substcoitial 'RdRjp <.Vv:; 

15 activity. However, substitution at this position with !. 
tyrosine (Y) [clone 4 (SlY) ] yielded an inactive NS5B 
protein (Table 5) . 

Similarly, all known NS5B sequences have a 
methionine residue at position 2 . Modified versions of 

20 clone 4 were constructed in which this methionine was 

substituted with either alanine [clone 4 (M2A) 1 , leucine 
[clone 4 (M2L) ] or threonine [clone 4 (M2T) ] . 
Surprisingly, all of these novel NS5B proteins exhibited 
substantial RdRp activity. 

25 In a final example, addition of an amino acid 

residue prior to the first residue of the mature NS5B . 
protein (serine at position 1) , as in clone 4 (MAS) , in 
which an alanine residue is inserted before the serine , 
residue, yields a substantially functional NS5B RdRp 

30 (Table 5) . 
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These findings, in which amino acid substitutions 
are made at residues for which there is no naturally 
occurring variation aind that result in the generation of 
fxinctional, as well as dramatically improved functional, 
5 NS5B proteins, were totally unanticipated. . 

Moreover, these data demonstrate that the nature of 
amino acid residue changes capable of giving rise to 
functional variants of NS5B is not limited to . . ' 

conservative amino acid substitutions. For exanple, 

10 while a substitution of threonine for serine at position 
1 Ccui be viewed as a conservative amino acid . ^ . 

substitution, the substitution with glycine represents a 
dramatic change in the chemical nature of -the amino acid 
side-chain. Similarly, substitution of the methionine , 

15 residue at position 2 with leucine represents a 
conservative change. However, replacement with 
threonine, a clear non-conservative change, yields a . 
fimctional and fiinctionally improved NS5B protein. 
Based on these discoveries, it is therefore fully 

20 contemplated here that additional changes of the amino 
acid residues among the NS5B sequences of the invention, 
both conservative and non- conservative in nature, at 
various residue positions and in various combinations 
will yield useful NS5B compositions and are therefore 

25 within the scope of the invention.. 



30 
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Table 5 

RdRp Activities of Various HCV NS5B Proteins 

5 Protein# specific activity * 

clone 4 (SEQ ID NO: 2) 9.3 

clone 4(S1G) (SEQ ID NO: 57) 23.0 

clone 4(S1A) (SBQ ID NO: 58) 3.5 

clone 4 (SIT) (SBQ ID NO: 59) 3.8 

10 clone 4(Siy) (SBQ ID NO: 60) <0.1 

Clone 4(M2A) (SEQ ID NO: 61) 14.0 

clone 4(M2L) (SEQ ID NO: 62) 13.0 

clone 4(M2T) (SEQ ID NO: 64) 22.3 

clone 4 (MAS) (SEQ ID NO: 65) 15.0 

15 

* pmole NTP incorporated per >ig NS5B protein per 60 minutes 

under standard reaction conditions. 

# while changes to SEQ ID N02 are exeinplified herein, the 
recited alterations may also be made to the other NS5B amino 

20 acid sequences, i.e. SEQ ID NOS: 8, 9, 10, 11, 12, 13. Thus, 

sequences so altered are also contemplated to be within the 
scope of the invention . 

25 While Exait5)les 5 and 6 demonstrate that various 

changes in the amino acid sequence of SEQ ID NO: 2 may 
be made to advantage, it is fully contemplated that the 
residue changes exeirplif ied here will also have similar 
utility, either singly or in various combinations, in 

30 other NS5B sequences, including sequences of the 

invention (e.g., 8, 9, 10, 11 and 13, those of Tables 4 
and 5), any natural allelic variants, mutants and 
derivatives of these and other NS5B sequences. That is 
to say, the chemges described in Exanples 5 cind 6, when 

35 introduced into any NS5B sequence, will likely provide a 
functional or fimctionally improved NS5B, and are fully 
contemplated to be within the scope of the present 
invention. 
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Several exanples of nucleic acid sequences encoding 
variant NS5B protein sequences useful in the practice of 
the invention comprise sequences provided in Table 6- 

Table 6 



Nucleic acid sequences encoding variant 1IS5B Proteins* 



Parental SEQ ID Codon Position/Change 



SEQ ID NO: 1 


A codon encoding aleoiine at position 75 to any 
codon encoding valine 


SBQ ID NO: 1 


A codon encoding asparagine at position 177 to any 
codon encoding aspartic acid 


SEQ ID NO: 1 


A codon encoding serine at position 543 to any codon 
encoding proline 


SEQ ID NO: 1 


A codon encoding serine at position 1 to any codon 
encoding glycine 


SEQ ID NO: 1 


A codon encoding serine at position 1 to any codon 
encoding alanine 


SBQ ID NO: 1 


A codon encoding serine at position 1 to any codon 
encoding threonine 


SEQ ID NO: 1 


A cpdon encoding serine at position 1 to any codon 
encoding tyrosine 


SEQ ID NO: 1 


A codon encoding methionine at position 2 to any codon 
encoding alanine 


SEQ ID NO: 1 


A codon encoding methionine at position 2 to any codon 
encoding leucine 


*SEQ ID NO: 1 


A codon encoding methionine at position 2 to any codon 
encoding threonine 


SEQ ID NO: 1 


Insertion of any codon encoding alanine at -1 



* While SEQ ID NO: 1 is exenqplified in this table, the 



alterations recited may also be introduced into other NS5B encoding 
nucleic acid sequences set forth herein, i.e., SEQ ID NOS: 3, 4, 5, 
6 and 7. Such altered nucleic acid sequences are also contemplated 
for use in the present invention. 



The exanples of Table 6 demonstrate various 
beneficial changes in the nucleic acid sequence of SEQ 
ID NO: 1. In view of these results, it is expected that 
the changes exemplified in SEQ ID N0:1 will also have 
similar utility, either singly or in various 
combinations, in other NS5B sequences, including 
sequences of the invention (e.g., 3, 4, 5, 6 and 7,), 
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any natural allelic varicints, mutants and derivatives of 
these and other NS5B sequences. That is to say, the 
changes described in this Table 6, when introduced into 
any NS5B sequence, will likely provide a functional or 
5 functionally improved NS5B, and are thus included in the 
scope of the present invention. 



EXAMPLE 7 

10 Utility of BC7 NS5B Protein for Discovery of 

Antiviral Campos itions 



The discovery of novel inhibitors of viral 
polymerases and related proteins often times requires 

15 the screening of large numbers of chemical compounds or 
mixtures of chemical compounds. Thus, an assay for 
polymerase activity that is capable of high volume 
screening, in other words, a high throughput assay, is 
desirable. There are a variety of assay methodologies 

20 well known to the trained artisan that allow the 

efficient screening of large numbers of samples [see, 
for exanple, Cole, JL, in Meth. Enzymology 275:310-328 
(1996)], and may utilize any number of activity 
detection and measurement technologies including, but 

25 not limited to, radiometric, colorimetric, fluorogenic, 
or chemi luminescent , any one of which may be suitable in 
the case of the HCV NS5B RdRp activity. 

In one approach, a high throughput assay of the 
RdRp activity of the functional HCV NS5B proteins of the 

30 invention is provided to encible the screening of large 
numbers of chemicals or other potential inhibitors for 
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activity against the enzyme. The assay is formatted in'.r 
96 -well microplates and measxires polymerase activity onV i 
an RNA template-primer by the incorporation of 
radiolabeled NTP into trichloroacetic acid 
5 (TCA) -precipitable RNA product. Radioactivity may be V^^^^ 
qucmtif ied by either direct scintillation spectrometry;" ; 
or phosphorimaging technology. . A phosphorimage of assay , 
results obtained with the NS5B protein of Example 4 . ; ■ 
, [clone 4] for one screening plate is presented in Fig. ; 
10 '6. The first (1) and last (12) columns of the pia:te ' 

. contain activity and background controls and a titration! 
'•,^r:^-^ot a reference inhibitor .compound. . Thus; wells AlV BlV^^^^^^^^^ 
\[\. ' 'l;-}i'A12 emd 'B12'' show"' the "activity of the * enzyme " an the 
absence of any test compound (100% of the expected 
15 activity). In columns 1 and 12, wells C through F, a j';. 
compound that was discovered to inhibit the HCV RdRp . . 
activity by use of the methods of the invention is 
included in decreasing concentrations. Wells HI and H12 
lack the HCV enzyme and illustrate the background (0% 
20 RdRp activity) in the assay. In this exanple, the 

remaining 80 wells contain a collection of small organic 
compounds that were tested for their ability to affect 
the RdRp activity of the HCV NS5B protein. Thus, it can 
be seen that the material in wells G2, DIO and FIO 
25 represent potent inhibitors of the HCV RdRp activity. - 
This exanple demonstrates that HCV NS5B proteins . 
may be used to advantage to identify and assess agents 
or materials that may affect the HCV RdRp and HCV 
replication. 

30 In summary, the disclosure of the present invention 

demonstrates that simple knowledge of an HCV NS5B 

'- " * ■ '- -.* 

• ■ ■ ■ •.'•7' 
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secpience'is insufficient to allow one to conclude, or 
even reliably predict that such sequence is a functional 

. sequence or has functional utility. Additionally, an 
HCV NS5B consensus sequence is not the only, cuid not 
necessarily the optimal;; functional NS5B protein . - 

' encoding sequence- The present invention provides novel i 

' NS5B sequences with uneuiticipated functionality. . ^ . / ; 
Additionaiily, the present invention djemohs^ 
particular sequence cheoiges in HCV-NS5B can : lead to 
unanticipated and significantly improved functionality. 
In light of the foregoing, the NS5B sequences of the 

. invention represent novel sequences ; with demonstrated :;. 
: functionality Eirid utility - aiid unique : in5)royemen1:s overw ; 
prior art. Finally, the HCV NS5B sequences of the . . / v 
invention have broad utility in research, diagnostic, ' 
therapeutic and pharmaceutical applications. 

While certain embodiments of the present invention 
have been described cuad/or exemplified above, various 
other embodiments will be apparent to those skilled in 
the art from the foregoing disclosure. The present 
invention is, therefore, not limited to the particular 
embodiments described and/or exemplified, but is capable 
of considersdDle modification without departure from the 
scope of the appended claims. 
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1. A nucleic acid molecule encoding a hepatitis C 
virus NS5B protein comprising the sequence of SEQ ID NO: 

5 1 cuid natiiral allelic variants, mutants and derivatives 
thereof . . 

2. A nucleic acid molecule encoding a hepatitis C 
virus NS5B protein having a sequence of SEQ ID NO. 2. 

10 

3. A nucleic acid molecule encoding a hepatitis C 
virus NS5B protein comprising the sequence of SEQ ID NO: 

3 and natural allelic variants, mutants and derivatives 
thereof . 

15 

4. A nucleic acid molecule encoding a hepatitis C 
virus NS5B protein having a sequence of SEQ ID NO: 8 

5. A nucleic acid molecule encoding a hepatitis C 
20 virus NS5B protein comprising the sequence of SEQ ID NO: 

4 and natural allelic variants, mutants and derivatives 
thereof . 

6. A nucleic acid molecule encoding a hepatitis C 
25 virus NS5B protein having the sequence of SEQ ID NO: 9. 

7. A nucleic acid molecule encoding a hepatitis C 
virus NS5B protein comprising the sequence of SEQ ID NO: 

5 and natural allelic variants, mutants and derivatives 
30 thereof. 
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8. A nucleic acid molecule encoding a hepatitis C 
virus NS5B protein having the sequence of SEQ ID NO: 10. 



9. A nucleic acid molecule encoding a hepatitis C 
virus NS5B protein comprising the sequence of SEQ ID NO: 

6 and natural allelic variants, mutants and derivatives 
thereof . 

10. A nucleic acid molecule encoding a hepatitis C 
virus NS5B protein having the sequence of SEQ ID NO: 11. 

11. A nucleic acid molecule encoding a hepatitis C 
virus NS5B protein comprising the sequence of SEQ ID NO: 

7 cuid natural allelic variants, muteuits and derivatives 
thereof . 

12 . A nucleic acid molecule encoding a hepatitis C 
virus NS5B protein having the sequence of SEQ ID NO: 12. 

13. A nucleic acid molecule encoding a hepatitis C 
virus protein, said molecule having a nucleic acid 
sequence selected from the group consisting of SEQ ID 
N0:1, SEQ ID N0:3, SEQ ID N0:4, SEQ ID NO: 5, SEQ ID 
N0:6, and SEQ ID NO: 7 and natural allelic variants, 
mutants and derivatives thereof. 

14. A nucleic acid as claimed in claim 13, said 
nucleic acid molecule being modified as set forth in 
Table 6. 

15. A nucleic acid molecule encoding a hepatitis C 
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virus NS5B protein, said protein having an amino acid 
sequence selected from the group consisting of SEQ ID 
NO: 2, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID 
NO: 11, SEQ ID NO: 12 and SEQ ID NO: 13. 

16. A hepatitis C virus NS5B protein having a 
sequence selected from the group consisting of SEQ ID 
NO: 2, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID 
NO: 11, SEQ ID NO: 12 and SEQ ID NO: 13. 

17. An HCV NS5B protein as claimed in claim 16, 
said protein being modified as set forth in Tables 4 and 
5. 



15 18. A vector comprising a nucleic acid sequence 

encoding a hepatitis C virus NS5B protein, said protein 
consisting essentially of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 2, SEQ 
ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, 

20 SEQ ID NO: 12, and SEQ ID NO: 13. 

19. A vector as claimed in claim 18, said vector 
comprising a nucleic acid encoding a hepatitis C NS5B 
protein, said amino acid sequence of said protein being 

25 altered as set forth in Tables 4 and 5. 

20. A hepatitis C virus NS5B protein comprising 
the sequence of SEQ ID NO: 2. 



30 



21. A hepatitis C virus NS5B protein comprising 
the sequence of SEQ ID NO: 8. 
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22. ' A hepatitis C virus NS5B protein comprising 
the sequence of SEQ ID NO: 9- 

23 . A hepatitis C virus NS5B protein comprising 
5 the sequence of SEQ ID NO: 10 . , 

24. A hepatitis C virus NS5B protein comprising 
the sequence of SEQ ID NO: li. 

10 25. A hepatitis C virus NS5B protein comprising 

the secjuence of SEQ ID NO: 12. 

• • . 26 : A hepatiti^ C virus J^S5B . protein comprising 

■ * the sequence of SEQ ID NO: 13. 

27. A hepatitis C virus NS5B protein having an 
amino acid secpience selected from the group consisting 
of SEQ ID NO: 2, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 
10, SEQ ID NO: 11, SEQ ID NO: 12, and SEQ ID NO: 13, 

20 wherein the amino acid residues at positions +1 and +2 
may be any amino acid. 

28. A hepatitis C virus NS5B protein as claimed in 
claim 27, wherein said amino acid sequence is altered as 

25 . set forth in Tables 4 and 5. 

29. A nucleic acid molecule encoding a hepatitis C 
virus NS5B protein in which the amino acid at position 
177 of the mature polypeptide is asparagine. 
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30. A hepatitis C virus NS5B protein in which the 
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amino acid at position 177 of the mature polypeptide is /. 
asparagine. 

'••*', .},•••.''' 

31- A nucleic acid molecule as claimed in claim 1/ 
wherein the codon encoding an amino acid at position 75 
is chcuiged from alanine to valine. \ \?;> - ! 

32 J A hepatitis C virus NS5B protein as claimed ^^1^^^^ 
claim 20, wherein the amino acid at position 75 is" 
' changed from alanine to valine;' 



33 . . A nucleic acid molecule as claimed in claim IV 
yWherein the an amino acid at' " posit ion 

is changed from asparagine to aspartic acid: • • ■.::;'--^'\r 

34. A hepatitis C virus NS5B protein as claimed in 
claim 20, wherein the amino acid at position 177 is 
changed from asparagine to aspartic acid. 

35. A nucleic acid molecule as claimed in claim 1, 
wherein a codon encoding an amino acid at position 534 
is changed from serine to proline. 

36. A hepatitis C virus NS5B protein as cla.imed in 
claim 20, wherein the amino acid at position 534 is 
changed from serine to proline. 

37. A nucleic acid molecule encoding a HCV NS5B 
protein having a sequence selected from the group 
consisting of SEQ ID N0:1, SEQ ID NO: 3, SEQ ID NO:4, SEQ 
ID NO: 5, SEQ ID NO: 6, and SEQ ID NO: :7, wherein the 
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.^codons " encoding amino acids at positions -1/ -2, and -3. 
may encode any amino acxd. 

38. A nucleic acid molecule as claimed in claim 

., 5 .13, further comprising a jtag sequence . ' , o'\ 

*.:•• - * '! ■ ■••.*'.* 

39. A nucleic acid ^molecule as qlaimed/in claim ; 

* 38, said tag sequence being selected from the group » v-^ 
consisting of homopolymeric nucleic acid sequences, J^^^^ 
10 polyhistidine, flag epitope, c-myc epitope, -; 

transmembrcuie epitope of the influenza A virus ' 
hemaglutinin. protein, , protein ^ cellulose jbindi 
V^V ; • domain; calmodulin binding protein/ v/ m^ 

' protein, chitin binding dbmaiil, glutaithione 
;;15 S-transferase, or bio tin. 

40. A nucleic acid molecule as claimed in claim 
14, further comprising a tag sequence - 

20 41. A nucleic acid molecule as claimed in claim 

40, said tag sequence being selected from the group 
consisting of homopolymeric nucleic acid sequences, 
polyhistidine, flag epitope, c-myc epitope/, 
transmembrane epitope of the influenza A virus 

25 hemaglutinin protein, , protein A, cellulose binding 

domain, calmodulin binding protein, maltose binding • •: 
protein, chitin binding domain, glutathione ■ 
S-transferase, or biotin'.-; r]-', \ v/'"' 
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42- A hepatitis C virus NS5B protein selected from 
the group consisting of SEQ ID NO: 2, SEQ ID NO:. 8, SEQ 



71 



wo 99/51781 PCTAJS99/07404 

ID NO: 9, SBQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12 
I and SEQ ID NO: 13 and a modified NS5B protein as set 
forth in Tables 4 and. 5, further coitqprising a protein 
tag sequence, said tag sequence being selected from the 
.group consisting of homopolymeric nucleic acid 
sequences, polyhistidine, flag epitope, c-myc epitope,, 
transmembrane epitope of the influenza A virus 
hemaglutinin protein, , protein A, cellulose binding 
domain, calmodulin binding protein, maltose binding 
protein, chitin binding domain, glutathione 
S- transferase, or biotin. 

43. A nucleic acid molecule encoding a hepatitis C 
virus protein having a nucleic acid sequence selected 
from the group consisting of SEQ ID N0:1, SEQ ID NO: 3, 
SEQ ID N0:4, SEQ ID NO: 5, SEQ ID N0:6, and SEQ ID NO: 
7, natural allelic variants, mutants and derivatives of 
SEQ ID NO:l, SEQ ID N0:3, SEQ ID N0:4, SEQ ID NO: 5, SEQ 
ID NO: 6, and SEQ ID NO: 7 and nucleic acids modified as 

I set forth in TedDle 6, wherein an amino acid codon for 
alanine precedes the first amino acid codon of the 
mature NS5B protein. 

44. A hepatitis C virus NS5B protein having a 
sequence selected from the group consisting of SEQ ID 
NO: 2, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID 

I NO: 11, SEQ ID NO: 12 and SEQ ID NO: 13 and a modified 
I NS5B protein as set forth in Tables 4 and 5, wherein an 

alanine residue precedes the first amino acid of the 

mature NS5B protein. 

45. A method for assaying a test conpotind, for a 
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modulating activity against hepatitis C virus 
comprising: 

i) providing an enzymatically active 
hepatitis C virus NS5B protein comprising a sequence 

5 selected from the group consisting of SEQ ID NO: 2, SEQ 
ID NO: 8, SEQ. ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, 
and SEQ ID NO: 13; 

ii) contacting said protein with a test 
compound suspected of modulating hepatitis C virus NS5B 

10 activity; and 

iii) measuring modulation of said hepatitis C 
virus NS5B activity by said test compound. 

46. A method as claimed in claim 45, wherein said 
15 HCV NS5B protein sequence is modified as set forth in 

Tables 4 and 5. 

47. A method as claimed in claim 45, wherein the 
amino acids at positions +1 and +2 in said HCV NS5B 

20* protein may be any amino acid. 

48. A method as claimed in claim 45, wherein amino 
acids at positions -1, -2 and -3 of said HCV NS5B 
protein may be any amino acid. 



25 



30 



49. A method as claimed in claim 45,. wherein said 
HCV NS5B protein further comprises a tag sequence. 

50. A method for assaying a compound, for 
antagonistic activity against hepatitis C virus 
comprising: 
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i) providing an enzymatically active 
hepatitis C virus NS5B protein con^rising a sequence 
selected from the group consisting of SEQ ID NO: 2, SEQ 
ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, 

5 and SEQ ID NO: 13; 

ii) contacting said protein with a test 
compound suspected of antagonizing hepatitis C virus 
NS5B activity; and 

iii) measuring antagonism of said hepatitis C 
10 virus NS5B activity by said test compound. 



15 



51. A method as claimed in claim 50, wherein said 
HCV NS5B protein sequence is modified as set forth in 
Tables 4 and 5- 

52. A method as claimed in claim 50, wherein the 
amino acids at positions +1 and +2 in said HCV NS5B 
protein may be any amino acid. 



20 53. A method as claimed in claim 50, wherein said 

HCV NS5B protein further comprises a tag sequence. 

54. A method as claimed in claim 50, wherein amino 
acids at positions -1, -2 and -3 of said HCV NS5B 

25 protein may be any amino acid. 

55. A method for assaying a test compound, for 
agonistic activity against hepatitis C virus comprising: 

i) providing an enzymatically active 
30 hepatitis C virus NS5B protein comprising a sequence 

selected from the group consisting of SEQ ID NO: 2, SEQ 
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ID NO: SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, 

and SEQ ID NO: 13; 

ii) .contacting said protein with a test 
compound suspected of agonizing hepatitis C virus NS5B 

5. activity; and " • 

iii) .measuring agonism of said hepatitis C 

• . ■ ^" -. ' , . • * .* ' ■ ■ 

virus NS5B activity by said test compound. 

56. A method as claimed in claim 55^ wherein said 
10 HCV NS5B protein sequence is modified as set forth in '•' [ 
Tables 4 and 5. 

= 57. • A method as claimed, in claim 55, wherein the ; 

amino acids at positions +1 and +2 in said HCV NS5B 
15 protein may be any amino acid. 

58. A method as claimed in claim 55, wherein amino 
acids at positions -1, -2 and -3 of said HCV NS5B 
protein may be any amino acid. 



20 



59. A method as claimed in claim 55, wherein said 
HCV NS5B protein further comprises a tag sequence. 



60. A method for assaying a test compound for 
25 interaction with the hepatitis C virus NS5B protein 
sequence comprising: 

i) providing a hepatitis C virus NS5B 
polypeptide consisting essentially of a sequence 
selected from the group consisting of SEQ ID NO: 2, SEQ 
30 ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, 
SEQ ID NO: 12, and SEQ ID NO: 13; 
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ii) contacting said protein or peptide with a 
test compound suspected of interacting with. the 
hepatitis C virus NS5B protein or peptide;, and : 

iii) measuring the interaction of said test. ^ 
5 coicqpound or material with said hepatitis C virus NSSB. ';/ 

polypeptide. |. *' • 

61. A method as claimed in claim 60^ -wherein said ;^ 
HCV NS5B protein sequence is modified as set forth in 

10 "V 'Tables 4 and 5.*V^ ■ ;"^*^=v * l"^v/> - -^-'^t- 

62. A method as claimed , in claim 60, - wherein ,, the': .^^^^ 
; VvUaminb acids at p>ositions.:;+l and +2 in saidj'HCV NS^^^ 

• protein may be any amino acid. ' Y 

15 ;v ■■■ "■ ■ vVv^'^S 

63. A method as claimed in claim 60, wherein amino 
acids at positions -1, -2 and -3 of said HCV NS5B 
protein may be any amino acid. 

20 64. A method as claimed in claim 60, wherein said 

HCV NS5B protein further comprises a tag sequence. 

65 i A method for assaying a test compound, for 
interaction with a hepatitis C virus NS5B nucleic. acid 
25 sequence comprising: . 

i) providing a hepatitis C virus NS5B nucleic 
acid consisting essentially of a sequence selected from 
the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ 
ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, and SEQ ID NO: 7; 
30 ii) contacting said NS5B nucleic acid with a 

test compound suspected of interacting with the : . 
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hepatitis C virus NS5B NS5B nucleic acid; and ... . 

iii) measuring the interaction of said test 
compound with said hepatitis C virus NS5B NS5B nucleiic 
acid. 

5 i' ''\ * .^ . . ;V ; ' 

. i. . 66. A method as claimed in claim (55, wherein said 

HCV NS5B nucleic acid sequence is modified, as set forth. 
' in Table 6. • ' ' ' ; ;vhv ' V^-l * 'M-^^ 

10 67. A method for'detecting the presence of HCV in 

a biological sample, comprising nucleic acid 
. amplification that effectively amplifies a selected 
/ .n sequence from said virus and '.)'' '" 

. said selected sequence; said nucleic acid having a ; ' 
15 ; . sequence selected from the group consisting of SEQ ID 
. NO: 1, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO; 5, SEQ ID 
NO: 6, and SEQ ID NO: 7. 

68. A method as claimed in claim 67, said nucleic 
20 acid sequence being altered as set forth in Table 6. 

69. A method as claimed in claim 67, wherein the 
nucleic acid amplification method is performed employing 
a series of nested primers. 

25 . 

70. A method as claimed in claim 67, wherein said 
sequence is detected using an oligonucleotide comprising 
a sequence set forth in Table 1. 

30 71. A method for detecting immunological . 

interactions between viral polypeptides and antibodies 




SUBSnrUTE sheet (rule 26) 



wo 99/51781 PCT/US99y07404 

directed toward hepatitis C virus in a biological 
sample, said method comprising isolating said antibodies 
using an NS5B aiaino acid sequence selected from the 
group consisting of SEQ ID NO: 2, SEQ ID NO: 8, SEQ ID 
5 . ^ NO: 9, SEQ -ID NO: 10^ SEQ ID NO: 11, SEQ ID NO: 12, and 
*: ' SEQ ID NO: 13. 

72. A method as claimed in claim 71, wherein said 
HCV NS5B amino acid sequence is modified as set forth in 

10 Tables 4 and 5. 

73. A method as claimed in claim 71, wherein the 
.••amino acids at positions +1 and +2 in said HCV NS5B 

protein may be any amino acid. 



15 



74. A method as claimed in claim 71, wherein amino 
acids at positions -1, -2 and -3 of said HCV NS5B 
protein can be any amino acid. 



20 75. An antibody having affinity for a hepatitis C 

polypeptide having a sequence selected from the group 
consisting of SEQ ID NO: 2, SEQ ID NO: 8, SEQ ID NO: 9, 
SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, and SEQ ID 
NO: 13. 

25 

76. An antibody as claimed in claim 75, wherein 
said HCV NS5B protein sequence is modified as set forth 
in Tables 4 and 5. 

30 77. An antibody as claimed in claim 75, wherein 

the amino acids at positions +1 and +2 in said HCV NS5B 
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protein may be any amino acid. 



78. An antibody as claimed in claim 75, wherein 
amino acids at positions -1, -2 and -3 of said HCV NS5B 

5 protein may be any amino acid. 

79. An antibody having affinity for a polypeptide 
comprising a sequence set forth in Table 2. 

80. A nucleic acid molecule encoding a hepatitis C 
virus protein having a nucleic acid sequence selected 
from the group consisting of SEQ ID N0:1, SEQ ID N0:3, 
SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, and SEQ ID NO: 
If natural allelic variants, mutants and derivatives of 
SEQ ID N0:1, SEQ ID N0:3, SEQ ID N0:4, SEQ ID NO: 5, SEQ 
ID NO: 6, and SEQ ID NO: 7 and nucleic acids modified as 
set forth in Table 6, which encodes a viral antigen. 

81. A method for introducing a nucleic acid as 
claimed in claiia 80 into a host, wherein said nucleic 
acid is delivered to said host via a process selected 
from the group consisting of transformation, 
transfection, transduction, transgenetics, surgically 
and by physical bombardment with nucleic acid-coated 
particles . 

82. A viral antigen comprising an amino acid 
sequence 

selected from the group consisting of SEQ ID NO: 2, SEQ 
30 ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, 
SEQ ID NO: 12 and SEQ ID NO: 13 and a modified NS5B 
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protein as set forth in Tables 4 and 5 

83. A viral antigen as claimed in claim 82, 
wherein the amino acids at positions +1 and +2 in said 

5 HCV NS5B protein may be any amino acid. 

84. A viral antigen as claimed in claim 82, 
wherein amino acids at positions -1, -2 and -3 of said 
HCV NS5B protein may be any amino acid. 

10 

85. A viral antigen as claimed in claim 82, 
further comprising a protein tag sequence. 

86. A method of raising an immune response in a 
15 mammalian subject comprising administering to said 

subject a viral antigen encoded by the nucleic acid of 
claim 80. 

87 . A method of raising an immune response in a 
20 mammalian subject comprising administering to said 

subject a viral antigen according to claim 82. 

88. A method of amplifying hepatitis C virus 
nucleic acids employing primers comprising sequences 

25 selected from the oligonucleotide sequences set forth in 
Table 1. 

89. A host cell comprising a hepatitis C virus 
nucleic acid as claimed 13. 



30 



90. A host cell comprising a hepatitis C virus 
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* . ' ' - . . • ' * 

nucleic acid, said nucleic acid being altered as set 
. forth in Table 6 . . 
" • • • ' » 

91. A host cell comprising a nucleic acid encoding 
5 a hepatitis C NS5B protein/said nucleic acid being 

selected from, the group cohsisting of SEQ ID N0:1, SEQ 
ID N0:3^ SEQ ID N0:4, SEQ ID NO: 5/ SEQ ID NO: 6, and SEQ 
ID NO: 7, natural allelic variants, mutants and 
derivatives of SEQ ID N0:1, SEQ ID NO: 3, SEQ ID NO: 4, 
10 SEQ ID NO: 5, SEQ ID N0:6/ and SEQ ID NO: 7 and nucleic 
acids modified as set forth in Table 6, said host cell 
being selected from the group of cells consisting of 

)i ,/ < .V fungal cells,, insect cells, mammalian 

!; V; . cells, and plant cells. 

:-;i5 ■■ 

92. A cell line comprising a nucleic acid encoding 
a hepatitis C NS5B protein, said nucleic acid being 
selected from the group consisting of SEQ ID N0:1, SEQ 
ID N0:3, SEQ ID N0:4, SEQ ID NO: 5, SEQ ID N0:6, and SEQ 

20 ID NO: 7, natural allelic variants, mutants and 

derivatives of SEQ ID N0:1, SEQ ID NO: 3, SEQ ID N0:4, 
SEQ ID NO: 5, SEQ ID NO: 6, and SEQ ID NO: 7 and nucleic 
acids modified as set forth in Table G, said cell line 
being selected from the group consisting of a hepatocyte 

25 . cell line, Chang liver cells, Hela cells, U937 cells, . 

HepG2 cells, MT-4 cells and clonal cells generated from 
said cell lines. 

93. A cell line according to claim 92, wherein 
.30 said cell line expresses a functional NS5B protein. 

94 . A cell line according to claim 93 further 
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comprising a reporter system wherein a reporting signal., 
which from said reporter system is dependent on the 
polymerase activity of a functional HCV NS5B protein' in , ; 
said cells. 

95. A cell line according to claim 94 , whierein .'.V 
said reporter system includes a nucleic acid comprising 

. the antisense strand of an RNA molecule encoding a ;.: 
functional reporter molecule, said NS5B polymerase. * 
10 • . acting on said antisense molecule to generate 3i'''\'''):S}^i- '' 
. , translatable mRNA sequence in said host cell, said . ; . ■ 
■ translated reporter molecule sequence being capable pf y 

: \ ':VV pJ^oducing .an increase /.in signal which is: indicative pfv).- 
the presence of a functional NS5B polymerase. .V ;V; : , 

i6/;^V ■ / • ' ' ' " ' v^;' 

96. A cell line according to claim 95 wherein said 
functional mRNA molecule encodes a reporter molecule 
selected from a group consisting of luciferase, p- 
galactosidase, alkaline phosphatase or fluorescent green 

20 protein hdst cell is a hepatocyte. 

97. A method for assessing the functionality of a 
hepatitis C NS5B protein, comprising: 

i) providing a cell line as claimed in claim 

25 92; 

ii) transfoming said cell line with a 
nucleic acid encoding an antisense molecule, the 
complement of which encodes a functional reporter 
molecule, said NS5B polymerase acting on said antisense 

30 molecule to generate a translatable mRNA sequence in . 
. said host cell; and . , • ; 
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;iii) assessing said cells for ah increase in - 
signal produced by said translated reporter molecule 
, sequence, ! said increase indicating the presence of a • 
functional NS5B protein. 

98. A method as claimed in claim 97/ wheriein said 
functional reporter molecule is selected from a group ^ ^ 
y. :.' consisting of lucif erase, p-galactosidase, . alka 
phosphatase or fluorescent grieen protein. " 



10 



99. A host cell as claimed in claim 89,. wherein 
said host cell is a hepatocyte. - \ ^.i^^ 

' "-'yy'-y y 'lOO. k method for assaying a test compound for • '^^^^^^^^ 
15 antagonist activity against hepatitis C virus 
\ comprising: V 

i) providing a cell line according to claim 

93; 

ii) contacting said cell line with a test 
20 compound suspected of having antagonistic hepatitis C 

virus NS5B activity; and 

iii) measuring the antagonistic effect, if 
any, of said test compound on hepatitis C virus NS5B 
activity. 

25 ' ■ • : ' - • - ' ■ ' 

101. A method for assaying a test compound for . ;/ 
agonist activity against hepatitis C virus comprising: 
i) providing a cell line according to claim 
93; • . - ; • 

30 . ' ii) contacting said cell^ line with a test • 

compound suspected of having agonistic hepatitis C virus 
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NS5B activity; and 

iii) measuring the agonistic effect^ if any, 
of said test compound on hepatitis C virus NS5B 
activity. 

. 5 _ 

102. A method of preparing a hepatitis C virus 
NS5B protein comprising a sequence selected from a group 
consisting of SEQ ID NO: 2, SEQ ID NO: 8, SEQ ID NO: 9, 
SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, and SEQ ID 

10 NO: 13 comprising: ^ 

i) expressing said protein in a biological 

system; 

ii) solubilizing said protein; 

iii) extracting said protein 'and 

15 iii) enriching for said NS5B protein* 

103. A method as claimed in claim 102, wherein 
said hepatitis C virus protein has been altered as set 
forth in Tables 4 and 5. 

20 

104. A method as claimed in claim 102, wherein the 
amino acids at positions +1 and +2 in said HCV NS5B 
protein can be any amino acid. 

25 105. A method as claimed in claim 102, wherein 

amino acids at positions -1, -2 and -3 of said HCV NS5B 
protein may be any amino acid. 

106. A inethod as claimed in claims 102, wherein 
.30 said HCV NS5B further comprising a protein tag sequence 
according to claim 40. 



84 



SUBSTITUTE SHEET (RULE 26) 



wo 99^1781 



PCTAJS99/07404 



107. A method as claimed in claim 102, wherein 
said NS5B protein is enriched by a process selected from 
the group consisting of cell disruption, cell 
5 fractionation, partitioning, reverse micelle 
partitioning, aqueous two-phase extraction, 
precipitation, chromatography, ion exchange 
chromatography, chelation, chromatography, affinity 
chromatography, immunoaf f inity chromatography, high 
10 pressure liquid chromatography, hydrophobic interaction 
chromatography, centrif ugation, membrane filtration, gel 
filtration, immunoprecipitation, electrophoresis, 
isoelectric focusing, and isotachophoresis. 

15 108. A method as claimed in claim 102, wherein 

said biological system is selected from the group 
consisting of bacterial cells, fungal cells, insect 
cells, mammalian cells, and plant cells. 

20^ 109. A method as claimed in claim 102, wherein 

said biological system is cell free. 

110. A method for isolating an HCV NS5B protein, 
comprising: 

25 i) contacting an extract of a biological 

system containing the NS5B protein with a solid matrix 
possessing a diethylaminoethyl (DEAE) moiety as a 
functional group in a salt-containing buffer solution; 

ii) removing said NS5B protein from said solid 

30 matrix and further contacting said protein with a solid 
matrix comprising heparin as a functional group; 
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iii) exposing said heparin matrix containing 
bound NS5B protein to a buffer solution of continuously 
increasing levels of salt; 

iv) collecting NS5B protein-containing 

5 fractions and contacting said collected fractions with a 
solid matrix containing Cibacron Blue as a functional 
group; 

v) exposing said Cibacron Blue matrix 
containing bound NS5B protein to a buffered solution 

10 containing increasing levels of salt; and 

vi) collecting said enriched NS5B containing 

fractions . 

111. A method of generating an infectious viral 
15 vector, comprising incorporating a hepatitis C virus 

NS5B nucleic acid selected from the group consisting of 
SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, 
SEQ ID NO: 6, and SEQ ID NO: 7 into said viral vector. 

20 112. A method as claimed in claim 111, wherein 

said nucleic acid sequence is altered as set forth in 
Table 6. 

113. A method of generating an infectious viral 
25 vector comprising substituting an HCV NS5B homologous 
gene sequence derived from any HCV sequence with a 
hepatitis C virus NS5B nucleic acid selected from the 
group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID 
NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, and SEQ ID NO: 7 in 
30 said viral vector. 
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114. A method as claimed in claim 113, wherein 
said nucleic acid sequence is altered as set forth in 



Table 6. 



: .10 



5 





sequence selected from the group consisting of SEQ ID 
NO: 2, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID 
NO: 11, SEQ ID NO: 12 and SEQ ID NO: 13 and a modified 
NS5B protein as set forth in Tables 4 and 5, and 
20 selecting a clone from said library which expresses the 
reactive antibody and isolating said clone. 

117. A method for propagating HCV in an in vitro 
system comprising culturing a cell line in which 
25 functional HCV NS5B sequences are provided, said 

' sequences being selected from the group consisting of 
SEQ ID NO: 1, SEQ ID N0:3, SEQ ID N0:4, SEQ ID NO: 5, SEQ 
ID N0:6, and SEQ ID NO: 7, natural allelic variants, 
mutants and derivatives of SEQ ID N0:1, SEQ ID N0:3, SEQ 
30 ID N0:4, SEQ ID NO: 5, SEQ ID N0:6, and SEQ ID NO: 7 and 



nucleic acids modified as set forth in Table 6, under 
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conditions that facilitate the replication of HCV. 



118. A method for infecting a cell culture with 
hepatitis C virus comprising administering a viral 
vector which is generated as claimed in claim 111 • ; \ 

119. A method for infecting an animal, with ^^r- .y:.';/' 



10. 



hepatitis C virus comprising administering a. viral 
vector which is generated as claimed in claim 111. ' 



120. A method of propagating hepatitis C virus in 
• vitro con^risihg: . ; 'J : 

.'•V:- ■ Vrf-r contacting a "ceil^'line* with:'a:;func 

HCV NS5B protein consisting essentially of a sequence , 
;15 selected from the group consisting of SEQ ID NO: 2/ SiEQ -y 
ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11/ 
SEQ ID NO: 12, . and SEQ ID NO: 13; and 

ii) said cell line being further contacted 
with an HCV nucleic acid. 

20 

121. A method as claimed in claim 120, wherein 
said NS5B protein has been modified as set forth in 
Tables 4 and 5 

25 122. A method as claimed in claim 120, wherein 

the amino acid residues at positions +1 and +2 in said 
NS5B protein may be any amino acid. 

123. A method according to claim 120 wherein said 
30 NS5B nucleic acid is provided within a hepatitis C virus 
repiicon. ; ' . ' 
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124 . . A method of propagating hepatitis C virus . in a 
living host, comprising: 

i) delivering a nucleic acid ! encoding a * 
hepatitis C virus NS5B protein having a sequence 
5 : selected from the group consisting SEQ ID NO:. 2/ SEQ ID 
NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO:' 11/ SEQ . 
ID NO: 12, and SEQ ID NO: 13 to a living host/ said. . 
• . nucleic acid encoding a functional NS5B protein anid • 

being operably linked to an expression oper on; and ; • 
10 ii) contacting said living host with an HC^ 

molecule selected from the group consistingi of HCV . 
particles, HCV nucleic acids, and HCV repiicpns. j\ / 

' . 125. A method as claimed in claim 124,'": 
15 -. said NS5B encoding nucleic acid is altered as "set forth ? 
in Table 6. 

126. A method as claimed in claim 124, wherein said 
nucleic acid is delivered to said host via a process 
20 selected from the group consisting of transformation, 

transfection, transduction, transgenetics, surgically or 
by physical bombardment with nucleic acid-coated 
particles. 

25 127. A method according to claim 124 wherein said 

NS5B encoding nucleic acid of step i) is provided within 
a replicon. 

128. A kit for detecting the presence of HCV 
30 nucleic acids in a biological sample, compriising: 

i) oligonucleotides comprising a sequence set 
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forth in Table I, said oligonucleotides being 
hybridizable to said HCV encoding nucleic acid; 

ii) reaction buffer; and 

iii) an instruction sheet. 

5 . •:. . . 

129. A kit as claimed in claim 128, wherein said 
oligonucleotide contains a tag. 

• 130. A kit for detecting the presence of HCV in a 
0 biological sample, comprising: ^* 

i) antibodies immunologically specific for 
HCV proteins; 

. ii) a solid support with immobilized HCV 
antigens as a positive control; and 
5 iii) an instruction sheet. 

131. A kit as claimed in claim 130, wherein said 
antibody contains a tag. 
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ICSMSYTHTGAL ITPCAAEESK LPZHALSNSL LRHHNMVYAT TSRSASQRQK 50 Clone 4; SEQ ID NO: 2 

Clone 14; SEQ ID NO: 8 

Clone 21; SBQ ID NO: 9 

Clone 11; SEQ ID NO: 10 

R Clone 16; SEQ ID NO: U 

V . Clone 20; SEQ ID NO: 12 

KVTFDRLQVL DDHYRDVLKE HKAKASTVKH KLLSVEEACR LTPPHSARSK 100 

V " 



FGYGAKDVRN LSSKAVHIIIH SVWKDLLEDT ETPIDTTIMIl XNEVFCVQPE 150 



II III 
R06RXPARLI VYPDLGVRVC BKHALYNWS TLPQAVMGS5 YI^FQYSPGQR 200 

D-I 

D- H 

IV/A 

VEFLVNAWKS KRCPHCFAYD TRCFOSTVTE NDIRVEE5IY QCCDIAPEAR 250 



V/B 

QAIRSLTERL YIGGPLTOSK GQSCGYRRCR ASGVLTTSCG NTLTCYLKAS 300 



VI/C 

AACRAAXLQD CTMLVCGDDL WICESAGTQ EDAASLRVFT EMfTRTSAPP 350 



VII/D VIII 
I 1 I 1 

GDPPQPEYDL ELITSCSSNV SVAHOASGKR VYYLTRDPTT PLARAAWETA 400 



Fig.lA 
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RHTPVN5HLC NIXmAFTIM ARHIUITHFF SXLLAQEQLE KALDCEIYGA 450 



HYSIEPLDLP QIIQRLHGLS AFSLHSYSPG EINRVASCLR KLGVPPLRVW 500 



RHRARSVRAX LLSQGGRAAT CGKYLFKHIVV RTKLKLTPIP AASQLDLSGM 550 



FVAGYSGGDl YHSLSRARPR WPMWOXLLS VGVGIYLLPN R 591 Clone 4; SEQ ID HO: 2 

Clone 14; SEQ ID NO: 8 

™ C Clone 21; SEQ ID NO: 9 

Clone 11; SEQ ID NO: 10 

Clone 16; SEQ ID NO: 11 

Clone 20; SEQ ID NO: 12 



Fig. IB 
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SMSTSWfCAL VTPCAAEEQK LPINALSNSL LRKHNLVYST TSRSACQRQK 50 AF009606 /• • 

. - WO 97/12033 . ' v.^ . ■ V * • 

■jj if I s M— A- S Clone 4; SEQ ID NO: 2. ' ■ ; * I \ / 



KVTFDRLQVL DSHYQDVLKE VtOUy^SKVUV NLLSVEBACS LTPPHSAKSK 100 
. D— R K— T— - — K * R— 

FGYGAKDVRC HARKAVAHZM SVHKDLLEDS VTPIDTTIMA KHEVFCVQPE 150 
LSS H T r 

KGGRKPARLI VFPDLGVRVC EKMALYDWS KLPLAVMGSS YGFQYSFGQR 200 



VEFLVQAWKS KKTPMGFSYD TRCFDSTVTB SDIRTEEAIY QCCDLDPQAR 250 
; IIII-H C— N---V— S— . A-E- 



• . i - 1 



« f \. ■» . ■ 



• • ; r M vaikslterl' yvggpltmsr gqiccyi«cr ascvuttscg ntltcyikar 300 . : • ; V ■ : ^V(.^> J- \ j Vrl M^^i'^y M^^^?^^^ 



AAOIAAGLQD CTMLVOGDDL WICESAGVQ EDAASLRAFT EAMTRYSAPP 350 
GDPPQPEYDL ELITSCSSNV SVAHDGAGKR VYYLTRDPTT PLARAAWBTA 400 



RHTPVNSWLG NIXMFAPTLW ARMILHTHFF SVI*IARDQLB QALNCEIYGA 450 
IIIIIIII Y I-L-Q K— D 

CTSIEPLDLP PIIQRLHGLS AFSLHSYSPG EINRVAACLR KLGVPPLRAW 500 

qI:::::::: :::::::::: iiiiiisi v. 

RHRARSVRAR LLSRGGRAAI CGKYLFNWAV RTKLKLTPIA AAGRLDLSGW 550 

A 0 

K Q T P— SO- 



FTAGYSOGDI YHSVSHARPR WPWFCLLLLA AGVGIYLLPN R 
L-R S V 



591 AF009606 , \ 

WO 97/12033 

Clone 4; SEQ ID KO: 2 /: 



Fig. 3 




-A- 



GDPPQPEYDL ELITSCSSNV SVAHDASGKR VYYLTRDPTT PLARAAWETA 400 



RHTPVNSWLG NUMYAPTLW ARMIItKTHFF SILLAQEQLE KALDCQIYGA 450 
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MSHSmflGAL ITPCAAEESK LPIMALSNSL UOWNMVYAT TSRSASQRQK 50 Clone 4; SEQ ID NO: 2 



AF054247 



KVrFDRLQVL DDHYRDVLKB HiaXASTVKA KLLSVEEACaC LTPPHSARSK 100 



FGTGAKDVRN LSSKAVNHIH SVHKDLLEDT ETPZDTTIMA KNBVFCVQPE ISO 

R R -S 

n : I III 

KXSGRXPARLI vm>LGVRVC EXMALYNWS TLFQAVMGSS YGFQfYSPGQR 200 
IV/A 

N^AnOwL KKCPHGFAYD TRCFDSTVTE NDIRVEESIY QCCDLAPEAR 250 



QAIRSLTERL YIGGPLTHSK GOSCGYRRCR ASGVLTTSCG NTLTCYLKAS 300 

— N T 

/ 

VI/C 

AACRAAKLQD CXMLVOGDDL WXCESAGTQ EDAASLRVFT EAHTRYSAPP 350 

VII/D VIII ^ 

GDPPQPEYDL CLITSCSSNV SVAHDASGXR VYYLTROPrr PLARAAHETA 400 



RHTPVNSWLG NIIMYAPTUW ARMILMTHFF SILLAQEQLE KALDCBIYGA 450 
I Q 



HYSIEPLDLP QIIQRLHGLS AFSLHSYSPG BINRVASCLR XZX^VPPLRVW 500 
C 0 T 

RHRAR5VRAK LLSQG6RAAT CGKYUFNWAV RTXLKLTPIP AASQLDLSGW 550 



FVAGYSGGDI YHSLSRARPR IfFMMCLLIXS VGVGIYLLPN R 
PL 



591 Clone 4; SEQ ID NO: 2 
AF054247 



Fig. 5 




AdOO 318V1IVAV 1S39 
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SEQ ID NO: 1 (Clone 4) 
ATGTCAATGTCCTACACATGGACAGGCGCTCTGATTACACCAT^^ 50 

GGAGGAGAGCAAGCTGCCCATCAATGCGCTGAGCAAC^^ 100 

ACCATAACATQGTCTATGCCAOUiaVlXXXX^ 150 

AAGAAGGTAACTTTTGACAGGCTGO^GTCCTG^ 200 

CXnXXrTCAAGGAGATGAAGGCGAAGGCGTCCACAGTCAAG^ 250 

TATCCCTQGAAGAAGCXTGTAAGCTGACGCCC^^ 300 

AAATTO5GCTATGGGGCGAAGGACGTCCGGAACCT 350 

TAACCACATCCACrcCGTGTGGAAGGACTTGCTGGJ^ 400 

CAATTGACACCACCATOVTGGCAAAAAATGMGTTT^^ 450 

GAGAAAGGAQGCCGCAAGCXyiGCTCG(XTAATCX^^ 500 

GGTTCGCGTCTXXXaiGAAAATQGCCCTCTACAAaSTGGTCT^^ 550 

CTCAGGCTGTGATGGGCTCCTCATACGGATTCCAGTACT^^ 600 

CXKXm^SAGTTCCTOGTGAATGCCTGGAAGTCAAAGA^ 650 

CnTOGCATATGACACCCGCTOTTTCGAC^^ 700 

TOZGTGTTGAGGAGTCAATTTACCAATGTTC^^ 750 

AGACAGGCXATAAGGTCGCITACAGAGCGGCTTTATATCGGGCX^^ 800 

GACTAACTOUIAAGGGCAGAGCTGCGGTTATCGCCGGTC^ 850 

GCGTACTGACXSACTAGCTGCGGTAATACCXrrcACATGTTAC^^ 900 
TercCMCCTGTCGAGCTCXyUUVGC^ 950 

CGGAGAOSACCTaSTCGTTATCTGTGAAAGCGCGGGAACC^ 1000 
CGGCGAGCCTACGAGTCTTCyvCGGAGGCTATGACrAGGTACTCTGCCCCC 1050 
CCriXXX3GACCCXXX:CC7UVCCAGAATACGACTTGGAGCTAATAACATC^^ 1100 . 

CTCCTCCMiCaiXST^ 1150 

ACXrrCACCCGTGACCCCACCACCCCnxrrrGCK 1200 

GCTAGACACACTCCAGTTAACTCCTCGCTAGGCAACATCATCATGTA 12 50 

GCCCACTCTGTGGGCAAGGATGATTCTGATGACT^ 1300 

TTCTAGCTCAGGAGCAACTTGAAAAAGCCCTAGATTGTG^ 1350 

GCa:ACTACKXATTGAGCCACTTGACCrACCT(^ 1400 

CCATQGTCTTAGCXXXnTrrTCACTCCACAGTTAC^^ 1450 

ATAGGGTGGCTTOITGCCTCAGGAAGCTTGQGGTACC^^ 1500 

TGGAGACATaXXSCCAGAAGOSTC^^ 1550 

GAGGGCTGCCACTTGTGGCAAGTACCrrCTT^ 1600 

AGCTTAAACTCACrcCAATTCCGGCTGCGTCCCAGTTC^ 1650 

TGGTTCGTTGCTGGTTACAGCGGGGGAGACATATATCACAGCCTC 1700 

TGCCOSACXCCGCTGGTTCATGTGGTGCCTACTC^ 1750 

TAGGCATCTACCTGCTCCXXSAACCGATGA 1779 



Fig. 7 
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SEQ ID NO; 4 (Clone 21) 

ATGTCAATGTCCTACACATGGACAGGCGCTCTGATO^ 50 

GGA<X3AGAGCAAGCTGCXXyVTCAATGCGCTGAGC^ 100 

accataacatggtctatgcx:acaacatcccgcagcgcaagc» 150 
/•aagaaggtaacttttgacaggctgcaagt^^ . 200 

CGOXXrrCAAGGAGATGAAGckxSAAGGCXSTCCAdlGT^ 250 

TATCCGTCGAAGAAGCCTGTAAGCTGACGCCCC^ 300 

AAATTTOGCTATGGGGCXSAAGGACCTCCGGAA^^ 350 

TAACCACATCCACTCCGTGTGGAAGGACTT^ 400. 

OUITTGACACXACCATCATGGCAAAAAATGAGGTT^^ 450 

GAGAAAGGAGGCCGCAAGCCAGCTCGCCTAATOT 500 

GGTTCXXX?PCn!GCGAGAAAATGGCCCTCTACX3AC^^ 550 

CTOVGGCTGTCATGGGCrarK^ 600 

CGGGTCGAGTTCCTCGTGi^TGCXr^^ 650 

CTTCGCATATGACACCCGCTGT^^ 700 . 

. AGACAGGCOVTAAGGTCGCTTACAGAGCGGCTTTATATCXXXX^^ 800 

GACTAACTCAAAAGGGCAGAGCTCCGGTCATCGCCGGTGCC^T^^ 850 

GCGTACTGACGACTAGCriXXXXSTAATACarrCACATGTT 900 

TCTQCAGCCTOTCGAGCTGaVAAGCTCCAGGACTG» 950 

CGGAGACGAarrCATCGTTATCTGTGAAAGCGCGGGAACa^ 1000 

OXKiXyiGCCrACGAGTCrTCACGGAGQCTATGACTAGGT^ 1050 
CCTGGGGACCCGCCCXAACCAGAATACGACITOGAGCTAATAACATCATC 1100 

CTCCrCCAACGTGTCGGTCGCGCACXSAT^ 1150 

ACCTCACCCXn^CCCCACCACCCCTCTTGCGCGGGCT^ 1200 

GCTAGACACACTCCAGTTAACTCCTGGCTAGGCAACATCATCATCT 1250 

GCCCACTCTGTOGGCAAGGATGATTCTGATC^ 1300 

TTCTAGCTCMGAGCAACTTGAAAAAGCXCTAGATT^^ 1350 

GCCCACTACTCCATTGAGCCACrrcACCTACCTCAG^ 1400 

OIATGGTCTTAGCGCGTTTTCACTCCACAG 1450 

ATAGGGTGGCTTOlTGCXrrCAGGAAGC^^ 1500 

TGGhGl^ChrCCQGGCChG^ 1550 

GAGGCXrrGCCACTTGTGGCAAGTACCTC^^ 1600 

AGCTTAAACTCACTCCAATTCCGGCTGCGTCCCAGTT^ 1650 

TGGTTOSTTGCnXSGTTACAGCGGGGGAGACATATACCACAG^^ 1700 

TGCCaSACCCTGCTGGTTCATGTGGTGCXTACTC 1750 

TAGGCAlCTrACCTGCTCCCCAACCGATGA 1779 



Fig. 9 
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SEQ ID NO: 6 (Clone 16) 

ATGTCAATGTCCTACACATGGAGAGGCGCTCTGATTAC^ . • 50 

GGAGGAGAGCAAGCTGCCCATCAATGCGCTGAGCAAC^^ ' 100 

ACCATAACATQGTCTATGCCACAACATCCOGCAGCGCAA^^ 150 • 

AAGAAGCn'AACTTTTGACAGGCTGCAAGTCCT^ 200 ; 

CGTGCTCAAGGAGATGAAGGCGAAGGCGTCCAC^^ 250 

tatccgtggaagaagcx:tgtaagctgacgcccccacattcagcc^ 300 • 

: AAATTTCGCTATGGGGCGAAGGACGTCXXS^ 

TAACCACATCCACTCCGTGTGGAAGGACTTGCK^^ \ , 400 a 
CAATTGACACCACCATCATGGCAAAAAATGAGGTCTT^^ 

GAGAAAGGAGGCCGCAAGCCAGCKXSCCTAATC^^ ; 

GGTTCGCGTGTGCGAGAAAATGGCCCTCTACGACGT^^ 550 

CTCAGGCKnX3ATXK«nX^^ 600 

: ; 'ottcgcatatgacac^^ X 700V 

TCCGTGTTGAGGAGTCAATTTACCAATGTTCnm v ' 750 

AGA(»GGCCATAAGGrrcGCTTACAGAGCGGCTTTATAT^ . 

■ GACTAACTCAAAAGGGCAGAGCTGCXXSTTATCGCCGGTG^^ 850 

GCXM'ACTGACGACTAGCTGCXXSTAATACXXrrcACATGTTAC^^ 900 

TCTGCAGCCTGTCGAGCTGCAAAGCTCCAGGACTGC^ 950 

CXSGACaVCGACTTCGTCGTTATCTGTGAAAGCGCGGGAAC^ 1000 

CGGCGAGCCTACGAGTCTTCACGG AGGCTATGACTAGGTACTCT^ 1050 
CCTGGGGAOXGCCCCAACCAGAATACGACTTGGAGCTAATAACATC ATG 1100 

CTCCTCCAACGTGTCGGTCGCGOVCGATGCATCTGGC^^ 1150 

ACCTCACCCGTGACXXrCACCACXCCTCTTGCGCGGGCTG^ 1200 

GCTAGACACACTCCAGTTAACTCCTGGCTAGGCAACATCATC AT^ 1250 

GCCCACTCTGTGGGCAAGGATGATTCTGATGACTCAC^^ 1300 

TTCTAGCTCAGGAGOUlCrTGAAAAAGCCXrrAGAT^^ 1350 

GCCCACTACTCOiTTGAGCCACTTGACCTACCTCAG^ 1400 

CCATGGTCTTAGCGCGTTTTCACrcCACAGTTAC^^ 1450 

ATAGGGTGGCTTCATGCCTCAGGAAGCTTGGGGTACCACX:e^^ 1500 

TGGAGACATXXSGGCCAGAAGCGTCCGOKrrAAGTTAC^^ 1550 

GAGGGCTGCCACTTGTGGCAAGTACCIXr^^ 1600 

AGCTTAAACTCACTCCAATTCCGGCTGCGTCCC^ 1650 

TGGTTCGTTGCTGGTTACAGCGGGGGAGACATATATCACAGCCTG^ 1700 

TGCCCGACCCCGCTGGTTCATXnXSGTGCCTAC^ 1750 

TAGGCyiTCTACCTGCTCCX:CAACXX3AT^^ 1779 

Fig. 11 
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SEQ ID IK>: 7 (Clone 20} 

, GTGTCAATGTCCTACACATGGACAGGCGCTCTGAT^^ 
' GGAGGAGAGCAAGCTGCCCATCAATGCGCTGAGC^ 
ACCATAACATCGTCTATGCCACAACATCCCGCAGCC^^ 
AAGAAGGTAACmTOACAGGCTCCAAGTC^^ . 
CGTGCTCAAGGAGATGAAGGCGAAGGdcnXXrA^ 
TATCXXTTOCSAAGAAGCCTGTAAGCTGACGC^^ 
AAATTTGGCTATGQQGOSAAGGACGTCCXS^ 
TAACOlCATCCACTCCGTGTGGAAGGACTTGCa^^ 
CAATTGACACXrACCATCATGGCAAAAAATGAGGTTTT^^ 
GAGAAAGGAGGCXXSCAAGCCAlCXntXSOT / 
GGTTCGCGTGTGCGAGAAAATGGCXXrrCTACGACGTGC^^ 
CTCAGGCTGTGATGGGCTCCTCATACGGATTCCAGTACTC^ 
CGGGTCGAGTTCCTGGTGAAOXXrCK^^ 
CTTCCfCATATGACACTCGC^ 
TC(:xn!CTTCAQGAGTCAATTTAC^^ 

. AGACAGGCCATAAGGTCGCTTACAGAGCGGCTTTATATCGGGGGTC 

GACTAACTCAAAAGGGCAGAGCTGCGGTTATCXXraSGTK ^ 

GCGTACTGACGACTAGCTGOOTAATACCCTCACATGOT 

TCTGCAGCCTGTCXSAGCTGCAAAGCrrCCAGGA 

CGGAGACGACCTKXnCGTTATCTGTGT^GCGCGGGAACCCAAGAG^ 
CXX^CGAGCCTACGAGTCTTCACGGAGGCTATGACTAGGTACTCTGCCC^ 
CCTGGGGACCOSCCCaU^CCAGAATACGACTTGGAG^ 
CrarrcCAACGTGTCGGTCGCGCACGATGC^^ 
ACCTCACCCGTGACCCCACCACCCCTCTTGCGCGGGCTGOnXSC^ 
GCTAGACACACTCCAGTTAACTCCTGGCTAGGCAACATCATCATGTAT^ 
GCCCACTClXnXSGGaU^GGATGATTCTGAT^ 
TTCPAGCTCAGGAGCAACTTGAAAAAGCCCTAGATTGTG^ 
GCCCACTACTCCATTGAGCCACTTGACCTACCTCAGATCATTC^ 
CXJ^TGCTCTTAGCGCXSTTTTCACTCCAOV^^ 
ATAGGGTGGCTTCATGCCTCAGGAAGCTTGGGGTACCACCC^^ 
: TCGAGACATCGGGCCAGAAGCGTCOXrGCT^ 

GAGGGCTGCCACTTGTGGCAAGTACCTCnTCAACTCG^^ 
AGCTTAAACTCACTCCAATTCCGGCTGCGCCCCAGTTGGACTT^^ 
TGGTTCGTTGCTGGTTACAGCGGGGGAGACATATATCACAGCC^ 
TGCCXXaiCX:CCGCTGGTTCMX5TGGTGCCTACTC^^ 
TAGGCATCTACXTTGCTCCCCAACCGATGA 



Fig. 12 
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SEQ ID NO: 13 (Clone 20(V-1M)) 
-1 M SMSYTVmSALITPCAAEESKLPINALSNSLLRHHNMVnfATO 50 

KVTFDRLQVLDDHYRDV^KEMKAKASTVmAKLLSVEEACKLTPPH^^ 100 

FGYGAKDVRNLSSKAViraiHSVWKDLLEIWOTPXIOT 150 

RCX5RKPARLIVYPDI/;vmrcEKMM*YDVVSTLPQAVl^ 200 

VEFLVNAWKSKKCPMGFAYDTRCFDSTVTErroiRVEESIYQCOTIJ^ 250 

QAIRSLTEIUjYIGGPLTNSKGQSCGYRRCRASGVLTTSCGNTLTCnjKM 300 

AACRAAKLQDCTMLVCXSDDLVVICESAGTQEDAASU^^ 350 

Ca>PPQPEYDLELITSCSSNVSVJan3ASGKR\nrn.TW)PTTPLAR^^ 400 

RHTPVNSWIXSOTIMYAPTLWARMIUmiFFSIUAQEQL^^ 450 

HYSIEPI^LPQIIQRIiHGLSAFSLHSySPGEIimVASCIJUaXS^^PL^^ 500 

WmARSVRAKLLSQGGRAATCGKyLFlWAVRTKIJa^ 550 

FVAGYSGGDI YHSLSRARPRWFMWCLLLLSVGVGI YLLPNR 591 



Fig. 13 
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SEQUENCE LISTING 

<110> Virophanna, Incorporated 

<120> Hepatitis C Virus NS5B Compositions and 
Methods of Use Thereof 



<130> HBPPOL 

<150> 60/080,509 
<151> 1998-04-02 

<150> 60/090,356 
<151> 1998-06-23 

<160> 53 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 1779 
<212> DNA 

<213> Hepatitis C Virus 



<400> 1 

atgtcaatgt cctacacatg gacaggcgct ctgattacac catgcgctgc ggaggagagc 60 

aagctgccca tcaatgcgct gagcaactct ttgctgcgcc accataacat ggtctatgcc 120 

acaacatccc gcagcgcaag ccagcggcag aagaaggtaa cttttgacag gctgcaagtc 180 

ctggatgacc actaccggga cgtgctcaag gagatgaagg cgaaggcgtc cacagtcaag 240 

gctaaacttc tatccgtgga agaagcctgt aagctgacgc ccccacattc agccagatcc 300 

aaatttggct atggggcgaa ggacgtccgg aacctatcca gcaaggccgt taaccacatc 360 

cactccgtgt ggaaggactt gctggaagac actgagacac caattgacac caccatcatg 420 

gcaaaaaatg aggttttctg cgttcaacca gagaaaggag gccgcaagcc agctcgccta 480 

atcgtgtacc cagacttggg ggttcgcgtg tgcgagaaaa tggccctcta caacgtggtc 540 

tccacccttc ctcaggctgt gatgggctcc tcatacggat tccagtactc tcctggacag 600 

cgggtcgagt tcctggtgaa tgcctggaag tcaaagaaat gccctatggg cttcgcatat 660 

gacacccgct gtttcgactc aacggtcacc gagaatgaca tccgtgttga ggagtcaatt 720 

taccaatgtt gtgacttggc ccccgaagcc agacaggcca taaggtcgct tacagagcgg 780 

ctttatatcg ggggtcccct gactaactca aaagggcaga gctgcggtta tcgccggtgc 840 

cgtgcgagcg gcgtactgac gactagctgc ggtaataccc tcacatgtta cttgaaggcc 900 

tctgcagcct gtcgagctgc aaagctccag gactgcacga tgcttgtgtg cggagacgac 960 

ctcgtcgtta tctgtgaaag cgcgggaacc caagaggacg cggcgagcct acgagtcttc 1020 

acggaggcta tgactaggta ctctgccccc cctggggacc cgccccaacc agaatacgac 1080 

ttggagctaa taacatcatg ctcctccaac gtgtcggtcg cgcacgatgc atctggcaaa 1140 

agggtatact acctcacccg tgaccccacc acccctcttg cgcgggctgc gtgggagaca 1200 

gctagacaca ctccagttaa ctcctggcta ggcaacatca tcatgtatgc gcccactctg 1260 

tgggcaagga tgattctgat gactcacttc ttctccatcc ttctagctca ggagcaactt 1320 

gaaaaagccc tagattgtga gatctacggg gcccactact ccattgagcc acttgaccta 1380 

cctcagatca ttcaacgact ccatggtctt agcgcgtttt cactccacag ttactctcca 1440 

ggtgaaatca atagggtggc ttcatgcctc aggaagcttg gggtaccacc cttgcgagtc 1500 

tggagacatc gggccagaag cgtccgcgct aagttactgt cccagggggg gagggctgcc 1560 

acttgtggca agtacctctt caactgggca gtaaggacca agcttaaact cactccaatt. 1620 

ccggctgcgt cccagttgga cttatccggc tggttcgttg ctggttacag cgggggagac 1680 

atatatcaca gcctgtctcg tgcccgaccc cgctggttca tgtggtgcct actcctactt 1740 

tctgtagggg taggcatcta cctgctcccg aaccgatga 1779 



<210> 2 
<211> 592 
<212> PRT 
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<213> Hepatitis C Virus 
<400> 2 

Met Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu He Thr Pro Cys Ala 

15 10 15 

Ala Glu Glu Ser Lys Leu Pro He Asn Ala Leu Ser Asn Ser Leu Leu 

20 25 30 

Arg His His Asn Met Val Tyr Ala Thr Thr Ser Arg Ser Ala Ser Gin 

35 40 45 

Arg Gin Lys Lys Val Thr. Phe Asp Arg I-eu Gin Val Leu Asp Asp His 

50 55 60 

Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys Ala Ser Thr Val Lys 
65 70 75 80 

Ala Lys Leu Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro. Pro His 

85 90 95 

Ser Ala Arg Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn Leu 

100 105 110 

Ser Ser Lys Ala Val Asn His He His Ser Val Trp Lys Asp Leu Leu 

115 120 125 

Glu Asp Thr Glu Thr Pro He Asp Thr Thr He Met Ala Lys Asn Glu 

130 135 140 

Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu 
145 150 155 160 

He Val Tyr Pro T^p Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu 

165 170 175 

Tyr Asn Val Val Ser Tbi Leu Pro Gin Ala Val Met Gly Ser Ser Tyr 

180 185 190 

Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val Asn Ala 

195 200 205 

Trp Lys Ser Lys Lys Cys Pro Met Gly Phe Ala Tyr Asp Thr Arg Cys 

210 215 220 

Phe Asp Ser Thr Val Thr Glu Asn Asp He Arg Val Glu Glu Ser He 
225 230 235 240 

Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He Arg Ser 

245 250 255 

Leu Thr Glu Arg Leu Tyr He Gly Gly Pro Leu Thr Asn Ser Lys Gly 

260 265 270 

Gin Ser Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr 

275 280 285 

Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Cys 

290 295 300 

Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp 
305 310 315 320 

Leu Val Val He Cys Glu Ser Ala Gly Thr Gin Glu Asp Ala TQa Ser 

325 330 335 

Leu Arg Val Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly 

340 345 350 

Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu Leu He Thr Ser Cys Ser 

355 360 365 

Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr Tyr 

370 375 380 

Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr 
385 390 395 400 

Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn He He Met Tyr 

405 410 415 

Ala Pro Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe Phe Ser 

420 425 430 

He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys Glu He 

435 440 445 

Tyr Gly Ala His Tyr Ser He Glu Pro Leu Asp Leu Pro Gin He He 
450 455 460 
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Gin Arg Leu His 
465 

61y Glu lie Asn 

Pro' Leu Arg Val 
500 

Leu Ser Gin Gly 

515*: 

Trp Ala Val Arg 

530 

Gin Leu Asp Leu 
545 

lie Tyr His Ser 

Leu Leu Leu Leu 
580 
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Gly Leu 
470 
Arg Val 
485 

Trp Arg 

Gly Arg 

Thr Lys 

Ser Gly 
550 
Leu Ser 
565 

Ser Val 



Ser Ala 

Ala Ser 

His Arg 

Ala Ala 
520 
Leu Lys 

535 

Trp Phe 
Arg Ala 
Gly Val 



3 / 20 
Phe Ser 

Cys Leu 
490 
Ala Arg 
505 

Thr Cys 
Leu Thr 

Val Ala 



Leu His 
475 

Arg Lys 

Ser Val 

Gly Lys 

'pro He 
540 
Gly Tyr 
555 

Arg Trp 



Arg Pro 
570 

Gly He Tyr Leu 
585 



Ser Tyr 

Leu Gly 

Arg Ala 
510 
Tyr Leu 
525 

Pro Ala 

Ser Gly 

Phe Met 

Leu Pro 
590 



Ser Pro 
480 
Val Pro 
495 

Lys Leu 

Phe Asn 

Ala Ser 

Gly Asp 
560 
Trp Cys 
575 

Asn Arg 



<210> 
<211> 
<212> 
<213> 



3 

1779 
DNA 

Hepatitis 



C Virus 



<400> 3 



atgtcaatgt 

aagctgccca 

acaacatccc 

ctggatgacc 

gctaaacttc 

aaatttggct 

cactccgtgt 

gcaaaaaatg 

atcgtgtacc 

tccacccttc 

cgggtcgagt 

gacacccgct 

taccaatgtt 

ctttatatcg 

cgtgcgagcg 

tctgcagcct 

ctcgtcgtta 

acggaggcta 

ttggagctaa 

agggtatact 

gctagacaca 

tgggcaagga 

gaaaaagccc 

cctcagatca 

ggtgaaatca 

tggagacatc 

acttgtggca 

ccggctgcgt 

atatatcaca 

tctgtagggg 



cctacacatg 

tcaatgcgct 

gcagcgcaag 

actaccggga 

tatccgtgga 

atggggcgaa 

ggaaggactt 

aggttttctg 

cagacttggg 

ctcaggctgt 

tcctggtgaa 

gtttcgactc 

gtgacttggc 

ggggtcccct 

gcgtactgac 

gtcgagctgc 

tctgtgaaag 

tgactaggta 

taacatcatg 

acctcacccg 

ctccagttaa 

tgattctgat 

tagattgtga 

ttcaacgact' 

atagggtggc 

gggccagaag 

agtacctctt 

cccagttgga 

gcctgtctcg 

taggcatcta 



gacaggcgct 

gagcaactct 

ccagcggcag 

cgtgctcaag 

agaagcctgt 

ggacgtccgg 

gctggaagac 

cgttcaacca 

ggttcgcgtg 

gatgggctcc 

tgcctggaag 

aacggtcacc 

ccccgaagcc 

gactaactca 

gactagctgc 

aaagctccag 

cgcgggaacc 

ctctgccccc 

ctcctccaac 

tgaccccacc 

ctcctggcta 

gactcacttc 

gatctacggg 

ccatggtctt 

ttcatgcctc 

cgtccgcgct 

caactgggca 

cttatccggc 

tgcccgaccc 

cctgctcccc 



ctgattacac 

ttgctgcgcc 

aagaaggtaa 

gagatgaagg 

aagctgacgc 

aacctatcca 

actgagacac 

gagaaaggag 

tgcgagaaaa 

tcatacggat 

tcaaagaaat 

gagaatgaca 

agacaggcca 

aaagggcaga 

ggtaataccc 

gactgcacga 

caagaggacg 

cctggggacc 

gtgtcggtcg 

acccctcttg 

ggcaacatca 

ttctccatcc 

gcccactact 

agcgcgtttt 

aggaagcttg 

aagttactgt 

gtaaggacca 

tggttcgttg 

cgctggttca 

aaccgatga 



catgcgctgc 

accataacat 

cttttgacag 

cgaaggtgtc 

ccccacattc 

gcaaggccgt 

caattgacac 

gccgcaagcc 

tggccctcta 

tccagtactc 

gccctatggg 

tccgtgttga 

taaggtcgct 

gctgcggtta 

tcacatgtta 

tgcttgtgtg 

cggcgagcct 

cgccccaacc 

cgcacgatgc 

cgcgggctgc 

tcatgtatgc 

ttctagctca 

ccattgagcc 

cactccacag 

gggtaccacc 

cccagggggg 

agcttaaact 

ctggttacag 

tgtggtgcct 



ggaggagagc 

ggtctatgcc 

gctgcaagtc 

cacagtcaag 

agccagatcc 

taaccacatc 

caccatcatg 

agctcgccta 

cgacgtggtc 

tcctggacag 

cttcgcatat 

ggagtcaatt 

tacagagcgg 

tcgccggtgc 

cttgaaggcc 

cggagacgac 

acgagtcttc 

agaatacgac 

atctggcaaa 

gtgggagaca 

gcccactctg 

ggagcaactt 

acttgaccta 

ttactctcca 

cttgcgagtc 

gagggctgcc 

cactccaatt 

cgggggagac 

actcctactt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1779 



<210> 4 
<211> 1779 
<212> DNA 

<213> Hepatitis C Virus 



<400> 4 

atgtcaatgt cctacacatg gacaggcgct ctgattacac catgcgctgc ggaggagagc 



wo 99/51781 



PCTAJS99/a7404 



aagctgccca 

acaacatccc 

ctggatgacc 

gctaaacttc 

aaatttggct 

cactccgtgt 

gcaaaaaatg 

atcgtgtacc 

tccacccttc 

cgggtcgagt 

gacacccgct 

taccaatgtt 

ctttatatcg 

cgtgcgagcg 

tctgcagcct 

ctcatcgtta 

acggaggcta 

ttggagctaa 

agggtatact 

gctagacaca 

tgggcaagga 

gaaaaagccc 

cctcagatca 

ggtgaaatca 

tggagacatc 

acttgtggca 

ccggctgcgt 

atataccaca 

tctgtagggg 



tcaatgcgct 

gcagcgcaag 

actaccggga 

tatccgtgga 

atggggcgaa 

ggaaggactt 

aggttttctg 

cagacttggg 

ctcaggctgt 

tcctggtgaa 

gtttcgactc 

gtgacttggc 

ggggtcccct 

gcgtactgac 

gtcgagctgc 

tctgtgaaag 

tgactaggta 

taacatcatg 

acctcacccg 

ctccagttaa 

tgattctgat 

tagattgtga 

ttcaacgact 

atagggtggc 

gggccagaag 

agtacctctt 

cccagttgga 

gcctgtctcg 

taggcatcta 



gagcaactct 
ccagcggcag 
cgtgctcaag 
agaagcctgt 
ggacgtccgg 
gctggaagac 
cgttcaacca 
ggttcgcgtg 
gatgggctcc 
tgcctggaag 
gacggtcacc 
ccccgaagcc 
gactaactca 
gactagctgc 
aaagctccag 
cgcgggaacc 
ctctgccccc 
ctcctccaac 



tgaccccacc 
ctcctggcta 
gactcacttc 
gatctacggg 
ccatggtctt 
ttcatgcctc 
cgtccgcgct 
caactgggca 
cttatccggc 
tgcccgaccc 
cctgctcccc 



4/20 
ttgctgcgcc 
aagaaggtaa 
gagatgaagg 
aagctgacgc 
aacctatcca 
actgagacac 
gagaaaggag 
tgcgagaaaa 
tcatacggat 
tcaaagaaat 
gagaatgaca 
agacaggcca 
aaagggcaga 
ggtaataccc 
gactgcacga 
caagaggacg 
cctggggacc 
gtgtcggtcg 
acccctcttg 
ggcaacatca 
ttctccatcc 
gcccactact 
agcgcgtttt 
aggaagcttg 
aagttactgt 
gtaaggacca 
tggttcgttg 
tgctggttca 
aaccgatga 



accataacat 

cttttgacag 

cgaaggcgtc 

ccccacatt-c 

gcaaggccgt 

caattgacac 

gccgcaagcc 

tggccctcta 

tccagtactc 

gccctatggg 

tccgtgttga 

taaggtcgct 

gctgcggtca 

tcacatgtta 

tgcttgtgtg 

cggcgagcct 

cgccccaacc 

cgcacgatgc 

cgcgggctgc 

tcatgtatgc, 

ttctagctca 

ccattgagcc 

cactccacag 

gggtaccacc 

cccagggggg 

agcttaaact 

ctggttacag 

tgtggtgcct 



ggtctatgcc - 

gctgcaagtc 

cacagtcaag 

agccagatcc 

taaccacatc 

caccatcatg 

agctcgccta 

cgacgtgatc. 

tcctggacag' 

cttcgcatat ' 

ggagtcaatt 

tacagagcgg 

tcgccggtgcV' 

cttgaaggcc / 

cggagacgac 

acgagtcttc ' 

agaatacgac 

atctggcaaa 

gtgggagaca 

igcccactctg 

ggagcaactt^'y 

acttgaccta'V 

ttactctcca 

cttgcgagtc 

gagggctgcc 

cactccaatt 

cgggggagac 

actcctactt 



. 120 

180 

240 

300 

360 

420 

480 
.540 

600 

660 

720 

.780 

840 

900 

960 
1020 
1080 
1140 , 
1200 •? 
1260 :V. 
;.132q;\\ 
::i380i:-V 
■1440 j;' 

1500 

1560 \ 
1620 M 
1680 
1740 
1779 



<210> 5 
<211> 1779 
<212> DNA 

<213> Hepatitis C Virus 



<400> 5 



atgtcaatgt 

aagctgccca 

acaacatccc 

ctggatgacc 

gctaaacttc 

aaatttggct 

cactccgtgt 

gcaaaaaatg 

atcgtgtacc 

tccacccttc 

cgggtcgagt 

gacacccgct 

taccaatgtt 

ctttatatcg 

cgtgcgagcg 

tctgcagcct 

ctcgtcgtta 

acggaggcta 

ttggagctaa 

agggtatact 

gctagacaca 

tgggcaagga 

gaaaaagccc 

cctcagatca 

ggtgaaatca 



cctacacatg 

tcaatgcgct 

gcagcgcaag 

actaccggga 

tatccgtgga 

atggggcgaa 

ggaaggactt 

aggttttctg 

cagacttggg 

ctcaggctgt 

tcctggtgaa' 

gtttcgactc 

gtgacttggc 

ggggtcccct 

gcgtactgac 

gtcgagctgc 

tctgtgaaag 

tgactaggta 

taacatcatg 

acctcacccg 

ctccagttaa 

tgattctgat 

tagattgtga 

ttcaacgact 

atagggtggc 



gacaggcgct 

gagcaactct 

ccagcggcag 

cgtgctcaag 

agaagcctgt 

ggacgtccgg 

gctggaagac 

cgttcaacca 

ggttcgcgtg 

gatgggctcc 

tgcctggaag 

aacggtcacc 

ccccgaagcc 

gactaactca 

gactagctgc 

aaagctccag 

cgcgggaacc 

ctctgccccc 

ctcctccaac 

tgaccccacc 

ctcctggcta 

gactcacttc 

gatctacggg 

ccatggtctt 

ttcatgcctc 



ctgattacac 
ttgctgcgcc 
aagaaggtaa 
gagatgaagg 
aagctgacgc 
aacctatcca 
actgagacac 
gagaaaggag 
tgcgagaaaa 
tcatacggat 
ccaaagaaat 
gagaatgaca 
agacaggcca 
aaagggcaga 
ggtaataccc 
gactgcacga 
caagaggacg 
cctggggacc 
gtgtcggtcg 
acccctcttg 
ggcaacacca 
ttctccatcc 
gcccactact 
agcgcgtttt 
aggaagcttg 



catgcgctgc 

accataacat 

cttttgacag 

cgaaggcgtc 

ccccacattc 

gcaaggccgt 

caattgacac 

gccgcaagcc 

tggccctcta 

tccagtactc 

gccctatggg 

tccgtgttga 

taaggtcgct 

gctgcggtta 

tcacatgtta 

tgcttgtgtg 

cggcgagcct 

cgccccaacc 

cgcacgatgc 

cgcgggctgc 

tcatgtatgc 

ttctagctca 

ccattgagcc 

cactccacag 

gggtaccacc 



ggaggagagc 

ggtctatgcc 

gctgcaagtc 

cacagtcaag 

agccagatcc 

taaccacatc 

caccatcatg 

agctcgccta 

cgacgtggtc 

ccctggacag 

cttcgcatat 

ggagtcaatt 

tacagagcgg 

tcgccggtgc 

cttgaaggcc 

cggagacgac 

acgagtcttc 

agaatacgac 

atctggcaaa 

gtgggagaca 

gcccactctg 

ggagcaactt 

acttgaccta 

ttactctcca 

cttgcgagtc 



60 

120 

180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140^ 
.1200 
1260 
1320 
1380 
1440 
1500 



wo 99/51781 



PCTAJS99/07404 



5 / 20 



tggagacatc gggccagaag cgtccgcgct aagttactgt cccagggggg gagggctgcc 
acttgtggca agtacctctt caactgggca gtaaggacca agcttaaact cactccaatt 
ccggctgcgt cccagttgga cttatccggc tggttcgttg ctggttacag cgggggagac 
atatatcaca gcctgtctcg tgcccgaccc cgctggttca tgtggtgcct actcctactt 
tctgtagggg taggcatcta cctgctcccc aaccgatga 

<210> 6 

<2ii> 1779 :]'\. ■ r ' ' ;■ " 

<212> DNA "'V' 
<213> Hepatitis C Virus V 

<4oo> 6 • . ^ •-• ' ■ 

atgtcaatgt cctacacatg gagaggcgct ctgattacac . catgcgctgc ggaggagagc 
aagctgccca tcaatgcgct gagcaactct ttgctgcgcc accataacat ggtctatgcc 
acaacatccc gcagcgcaag ccagcggcag aagaaggtaa cttttgacag gctgcaagtc 
ctggatgacc actaccggga cgtgctcaag gagatgaagg cgaaggcgtc cacagtcaag 
gccaaacttc tatccgtgga agaagcctgt aagctgacgc ccccacattc agccagatcc 
aaatttggct atggggcgaa ggacgtccgg aacctatcca gcaaggccgt taaccacatc 
cactccgtgt ggaaggactt gctggaagac actgagacac caattgacac caccatcatg 
gcaaaaaatg aggtcttctg cgttcaacca gagaaaggag gccgcaagcc agctcgccta 
atcgtgtacc cagacttggg ggttcgcgtg tgcgagaaaa tggccc^cta cgacgtggtc 
tccacccttc ctcaggctgt gatgggctcc tcatacggat tccagcactc tcctggacag 
cgggtcgagt tcctggtgaa tgcctggaag tcaaagaaat gccctatggg cttcgcatat 
gacacccgct gtttcgactc aacggtcacc gagaatgaca tccgtgttga ggagtcaatt 
taccaatgtt gtgacttggc ccccgaagcc agacaggcca taaggtcgct tacagagcgg 
ctttatatcg ggggtcccct gactaactca aaagggcaga gctgcggtta tcgccggtgc 
cgtgcgagcg gcgtactgac gactagctgc ggtaataccc tcacatgtta cttgaaggcc 
tctgcagcct gtcgagctgc aaagctccag gactgcacga tgcttgtgtg cggagacgac 
ttcgtcgtta tctgtgaaag cgcgggaacc caagaggacg cggcgagcct acgagtcttc 
acggaggcta tgactaggta ctctgccccc cctggggacc cgccccaacc agaatacgac 
ttggagctaa taacatcatg ctcctccaac gtgtcggtcg cgcacgatgc atctggcaaa 
agggtatact acctcacccg tgaccccacc acccctcttg cgcgggctgc gtgggagaca 
gctagacaca ctccagttaa ctcctggcta ggcaacatca tcatgtatgc gcccactctg 
tgggcaagga tgattctgat gactcacttc ttctccatcc ttctagctca ggagcaactt 
gaaaaagccc tagattgtga gatctacggg gcccactact ccattgagcc acttgaccta 
cctcagatca ttcaacgact ccatggtctt agcgcgtttt cactccacag ttactctcca 
ggtgaaatca atagggtggc ttcatgcctc aggaagcttg gggtaccacc cttgcgagtc 
tggagacatc gggccagaag cgtccgcgct aagttactgt cccagggggg gagggctgcc 
acttgtggca agtacctctt caactgggca gtaaggacca agcttaaact cactccaatt 
ccggctgcgt cccagttgga cttatccggc tggttcgttg ctggttacag cgggggagac 
atatatcaca gcctgtctcg tgcccgaccc cgctggttca tgtggtgcct actcctactt 
tctgtagggg taggcatcta cctgctcccc aaccgatga 



1560 
1620 
1680 
1740 
1779 



: 60 
120 
180 
240 
300 
360 
'420 
480 

540 

>; 600*' 

660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1779 



<210> 7 
<211> 1779 
<212> DNA 

<213> Hepatitis C Virus 



<400> 
gtgtcaatgt 
aagctgccca 
acaacatccc 
ctggatgacc 
gctaaacttc 
aaatttggct 
cactccgtgt 
gcaaaaaatg 
atcgtgtacc 
tccacccttc 
cgggtcgagt 
gacactcgct 



cctacacatg 
tcaatgcgct 
gcagcgcaag 
actaccggga 
tatccgtgga 
atggggcgaa 
ggaaggactt 
aggttttctg 
cagacttggg 
ctcaggctgt 
tcctggtgaa 
gtttcgactc 



gacaggcgct 
gagcaactct 
ccagcggcag 
cgtgctcaag 
agaagcctgt 
ggacgtccgg 
gctggaagac 
cgttcaacca 
ggttcgcgtg 
gatgggctcc 
tgcctggaag 
aacggtcacc 



ctgattacac 
ttgctgcgcc 
aagaaggtaa 
gagatgaagg 
aagctgacgc 
aacctatcca 
actgagacac 
gagaaaggag 
tgcgagaaaa 
tcatacggat 
tcaaagaaat 
gagaatc^aca 



catgcgctgc 
accataacat 
cttttgacag 
cgaaggcgtc 
ccccacattc 
gcaaggccgt 
caattgacac 
gccgcaagcc 
tggccctcta 
tccagtactc 
gccctatggg 
tccgtgttga 



ggaggagagc 
ggtctatgcc 
gctgcaagtc 
cacagtcaag 
agccagatcc 
taaccacatc 
caccatcatg 
agctcgccta 
cgacgtggtc 
tcctggacag 
cttcgcatat 
ggagtcaatt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



wo 99/51781 PCTAJS99/07404 
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taccaatgtt gtgacttggc ccccgaagcc agacaggcca taaggtcgct tacagagcgg 780 

ctttatatcg ggggtcccct gactaactca aaagggcaga gctgcggtta tcgccggtgc 840 

cgtgcgagcg gcgtactgac gactagctgc ggtaataccc tcacatgtta cttgaaggcc 900 

tctgcagcct gtcgagctgc aaagctccag gactgcacga tgcttgtgtg cggagacgac 960 

ctcgtcgtta tctgtgaaag cgcgggaacc caagaggacg cggcgagcct acgagtcttc 1020 

acggaggcta tgactaggta ctctgccccc cctggggacc cgccccaacc agaatacgac 1080 

ttggagctaa taacatcatg ctcctccaac gtgtcggtcg cgcacgatgc atctggcaaa 1140 

/agggtatact acctcacccg tgaccccacc acccctcttg cgcgggctgc gtgggagaca 1200 

gctagacaca ctccagttaa ctcctggcta ggcaacatca tcatgtatgc gcccactctg 1260 

tgggcaagga tgattctgat gactcacttc ttctccatcc ttctagctca ggagcaactt 1320 

gaaaaagccc tagattgtga gatctacggg gcccactact ccattgagcc acttgaccta 1380 

cctcagatca ttcaacgact ccatggtctt agcgcgtttt cactccacag ttactctcca 1440 

'- ggtgaaatca atagggtggc ttcatgcctc aggaagcttg gggtaccacc cttgcgagtc 1500 

tggagacatc gggccagaag cgtccgcgct aagttactgt cccagggggg gagggctgcc 1560 

acttgtggca agtacctctt caactgggca gtaaggacca agcttaaact^ cactccaatt 1620 

ccggctgcgc cccagttgga cttatccggc tggttcgttg ctggttacag' cgggggagac 1680 

atatatcaca gcctgtctcg tgcccgaccc cgctggttca tgtggtgcct actcctactt 1740 

tctgtagggg taggcatcta cctgctcccc aaccgatga 1779 

<210> 8 
<211> 592 
<212> PRT 

<213> Hepatitis C Virus 
<400> 8 

Met Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu lie Thr Pro Cys Ala 

1 5 10 15 

Ala Glu Glu Ser Lys Leu Pro lie Asn Ala Leu Ser Asn Ser Leu Leu 

20 25 30 

Arg His His Asn Met Val Tyr Ala Thr Thr Ser Arg Ser Ala Ser Gin 

35 40 45 

Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp Asp His 

50 55 60 

Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys Val Ser Thr Val Lys 
65 70 75 80 

Ala Lys Leu Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro Pro His 

85 90 95 

Ser Ala Arg Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn Leu 

100 105 110 

Ser Ser Lys Ala Val Asn His lie His Ser Val Trp Lys Asp Leu Leu 

115 120 125 

Glu Asp Thr Glu Thr Pro lie Asp Thr Thr lie Met Ala Lys Asn Glu 

130 135 140 

Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu 
145 150 155 160 

lie Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu 

165 170 175 

Tyr Asp Val Val Ser Thr Leu Pro Gin Ala Val Met Gly Ser Ser Tyr 

180 185 190 

Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val Asn Ala 

195 200 205 

Trp Lys Ser Lys Lys Cys Pro Met Gly Phe Ala Tyx Asp Thr Arg Cys 

210 215 220 

Phe Asp Ser Thr Val Thr Glu Asn Asp He Arg Val Glu Glu Ser He 
225 230 235 240 

Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He Arg Ser 

245 250 255 

Leu Thr Glu Arg Leu Tyr He Gly Gly Pro Leu Thr Asn Ser Lys Gly 

260 265 270 

Gin Ser Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr 
275 280 285 



wo 99/51781 P€rmiS99/07404 
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Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Cys 

290 295 300 

Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp 
305 310 315 320 

Leu Val Val He Cys Glu Ser Ala Gly Thr Gin Glu Asp Ala Ala Ser 

325 330 335 

Leu Arg Val Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly 

340 345 350 

Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu Leu He Thr Ser Cys Ser 

355 360 365 

Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr Tyr 

370 375 380 

Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr 
385 390 395 400 

Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn He He Met Tyr 

405 410 415 

Ala Pro Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe Phe Ser 

420 425 430 

He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys Glu He 

435 440 445 

Tyr Gly Ala His Tyr Ser He Glu Pro Leu Asp Leu Pro Gin He He 

450 455 460 

Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro 
465 470 475 480 

Gly Glu He Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly Val Pro 

485 490 495 

Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala Lys Leu 

500 505 510 

Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu Phe Asn 

515 520 525 

Trp Ala Val Arg Thr Lys Leu Lys Leu Thr Pro He Pro Ala Ala Ser 

530 535 540 

Gin Leu Asp Leu Ser Gly Trp Phe Val Ala Gly Tyr Ser Gly Gly Asp 
545 550 555 560 

He Tyr His Ser Leu Ser Arg Ala Arg Pro Keg Trp Phe Met Trp Cys 

565 570 575 

Leu Leu Leu Leu Ser Val Gly Val Gly He Tyr Leu Leu Pro Asn Arg 
580 585 590 

<210> 9 
<211> 592 
<212> PRT 

<213> Hepatitis C Virus 

<400> 9 

Met Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu He Thr Pro Cys Ala 

1 5 ■ 10 15 

Ala Glu Glu Ser Lys Leu Pro He Asn Ala Leu Ser Asn Ser Leu Leu 

20 25 30 

Arg His His Asn Met Val Tyr Ala Thr Thr Ser Arg Ser Ala Ser Gin 

35 40 45 

Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp Asp His 

50 55 60 

Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys Val Ser Thr Val Lys 
65 70 75 80 

Ala Lys Leu Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro Pro His 

85 90 95 

Ser Ala Arg Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn Leu 

100 105 110 

Ser Ser Lys Ala Val Asn His He His Ser Val Trp Lys Asp Leu Leu 
115 120 125 



wo 99/51781 PCTAJS99/07404 
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Glu Asp Thr Glu Thr Pro lie Asp Thr Thr lie Met Ala Lys Asn Glu 

130 135 140 

Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu 
145 150 155 160 

lie Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu 

165 170 175 

Tyr Asp Val He Ser Thr Leu Pro Gin Ala Val Met Gly Ser Ser Tyr 

180 185 190 

Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val Asn Ala 

195 200 205 

Trp Lys Ser Lys Lys Cys Pro Met Gly Phe Ala Tyr Asp Thr Arg Cys 

210 215 220 

Phe Asp Ser Thr Val Thr Glu Asn Asp He Arg Val Glu Glu Ser He 
225 230 235 240 

Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He Arg Ser 

245 250 255 

Leu Thr Glu Arg Leu Tyr He Gly Gly Pro Leu Thr Asn Ser Lys Gly 

260 265 270 

Gin Ser Cys Gly His Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr 

275 280 285 

Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Cys 

290 295 300 

Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp 
305 310 315 320 

Leu He Val He Cys Glu Ser Ala Gly Thr Gin Glu Asp Ala Ala Ser - 

325 330 335 

Leu Arg Val Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly 

340 345 350 

Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu Leu He Thr Ser Cys Ser 

355 360 365 

Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr Tyr 

370 375 380 

Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr 
385 390 395 400 

Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn He He Met Tyr 

405 410 415 

Ala Pro Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe Phe Ser 

420 425 430 

He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys Glu He 

435 440 445 

Tyr Gly Ala His Tyr Ser He Glu Pro Leu Asp Leu Pro Gin He He 

450 455 460 

Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro 
465 470 475 480 

Gly Glu He Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly Val Pro 

485 490 495 

Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala Lys Leu 

500 505 510 

Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu Phe Asn 

515 520 525 

Trp Ala Val Arg Thr Lys Leu Lys Leu Thr Pro He Pro Ala Ala Ser 

530 535 540 

Gin Leu Asp Leu Ser Gly Trp Phe Val Ala Gly Tyr Ser Gly Gly Asp 
545 550 555 560 

He Tyr His Ser Leu Ser Arg Ala Arg Pro Cys Trp Phe Met Trp Cys 

565 570 575 

Leu Leu Leu Leu Ser Val Gly Val Gly He Tyr I»eu Leu Pro Asn Arg 
560 585 590 



<210> 10 
<211> 592 
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<212> PRT 

<213> Hepatitis C Virus 
<400> 10 

Met Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu He Thr Pro Cys Ala 

1 5 10 15 

Ala Glu Glu Ser Lys Leu Pro He Asn Ala Leu Ser Asn Ser Leu Leu 

20 25 : . 30 

Arg His His Asn Met Val Tyr Ala Thr Thr Ser Arg Ser Ala Ser Gin 

35 .40 45 

Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp Asp His 

50 '55 60 : 

Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys Val Ser Thr Val Lys 
65 70 75 80 

Ala Lys Leu Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro Pro His 

85 90 . 95 

Ser Ala Arg Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn Leu 

100 105 110 

Ser Ser Lys Ala Val Asn His He His Ser Val Trp Lys Asp Leu Leu 

115 120 .125 ■ . 

Glu Asp Thr Glu Thr Pro He Asp Thr Thr. He Met Ala Lys Asn Glu V 
:v • 130 . • • • .•:.135 : \,, ^^ 140^ '\' '- \'\^ . 

Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys' Pro Ala Arg Leu 
: 145 150 155 160 

He Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu 

165 170 175 

Tyr Asp Val Val Ser Thr Leu Pro Gin Ala Val Met Gly Ser Ser Tyr 

180 185 190 

Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val Asn Ala 

195 200 205 

Trp Lys Pro Lys Lys Cys Pro Met Gly Phe Ala Tyr Asp Thr Arg Cys 

210 215 220 

Phe Asp Ser Thr Val Thr Glu Asn Asp He Arg Val Glu Glu Ser He 
225 230 235 240 

Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He Arg Ser 

245 250 255 

Leu Thr Glu Arg Leu Tyr He Gly Gly Pro Leu Thr Asn Ser Lys Gly 

260 265 270 

Gin Ser Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr 

275 280 285 

Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Cys 

290 295 300 

Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp 
305 310 315 320 

Leu Val Val He Cys Glu Ser T^a Gly Thr Gin Glu Asp Ala Ala Ser 

325 330 335 

Leu Arg Val Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly 

340 345 350 

Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu Leu He Thr Ser Cys Ser 

355 360 365 

Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr Tyr 

370 375 380 

Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr 
385 390 395 400 

TQa Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn Thr He Met Tyr 

405 410 415 

Ala Pro Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe Phe Ser 

420 425 430 

He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys Glu He 

435 440 445 

Tyr Gly Ala His Tyr Ser He Glu Pro Leu Asp Leu Pro Gin He He 
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450 455 460 

Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro' 
465 470 475 480 

Gly Glu He Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly Val Pro 

485 490 495 

Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala Lys Leu 

500 505 510 

Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu Phe Asn 

515 520 \. 525. ),'/ ' 

Trp Ala Val Arg Thr Lya Leu Lys Leu Thr Pro He Pro Ala Ala Ser 

530 535 540 ? 

Gin Leu Asp Leu Ser Gly Trp Phe Val Ala Gly Tyr Ser Gly Gly Asp 
545 •550 555" 560 

He Tyr His Ser Leu Ser Arg Ala Arg Pro Arg Trp Phe Met' Trp Cys ^ 
565 570 •\575 

, Leu Leu Leu Leu Ser Val Gly Val Gly lie Tyr Leu lieu Pro. Asn Arg 
580 585 590 

<2io> 11 * ' ■ ' V :^Cr 

<211> 592 ^v,;.' .. •* . -.•'.v' . ' y '^V" 

V ■ ■ . <212> .PRT. ^JX\( ■ ' V .V'"' ' ■ '-.t-- ^ V-i^/h 
; . So- <213> Hepatitis c'Virus . -^S'/ 



<400> 11 ■ ■ V/-' 

Met Ser Met Ser Tyr Thr Arg Thr Gly Ala Leu He Thr Pro Cys Ala 

1 5 10 . 15 

Ala Glu Glu Ser Lys Leu Pro He Asn Ala Leu Ser Asn Ser Leu Leu 

20 25 30 

Arg His His Asn Met Val Tyr Ala Thr Thr Ser Arg Ser Ala Ser Gin 
35 40 45 - 

Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp Asp His 

50 55 . 60 

Tyr Arg Asp Val Lfeu Lys Glu Met Lys Ala Lys Val Ser Thr Val Lys 
65 70 75 80 

Ala Lys Leu Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro Pro His 

85 90 95 

Ser Ala Arg Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn Leu 

100 105 110 

Ser Ser Lys Ala Val Asn His He His Ser Val Trp Lys Asp Leu Leu 

115 120 125 

Glu Asp Thr Glu Thr Pro He Asp Thr Thr He Met Ala Lys Asn Glu 

130 135 140 

Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu 
145 150 155 160 

He Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu 

165 170 175 

Tyr Asp Val Val Ser Thr Leu Pro Gin Ala Val Met Gly Ser Ser Tyr 

IBO 185 190 

Gly Phe Gin His Ser Pro Gly Gin Arg Val Glu Phe Leu Val Asn Ala 

195 200 205 

Trp Lys Ser Lys Lys Cys Pro Met Gly Phe Ala Tyr Asp Thr Arg Cys 

210 =215 220 • 

Phe Asp Ser Thr Val Thr Glu Asn Asp He Arg Val. Glu Glu Ser He 
225 230 235 / '.v. 240 

Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He Arg Ser 

. 245 250 255 

Leu Thr Glu Arg Leu Tyr He Gly Gly Pro Leu Thr Asn Ser Lys Gly 

260 265 270 

Gin Ser Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr 

275 280 285 

Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Cys 



) Leu Asp Leu . Pro Gin , He . lie 
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290 295 \ \r- 200[ . 

Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp 
305 310 315 320 

Phe Val Val He Cys Glu Ser Ala Gly Thr Gin Glu Asp Ala. Ala Sef • - 

325 330' ; ^ 

Leu Arg Val Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly 

340 .345 350^ , / , • . /. 

Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu Leu He Thr Ser Cys Ser 

355 * . : *360 r-"-: • V;^' : --^ss . \.,^': . ' : ••■>V:;:\; 

Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr Tyr 

\370 375 K 380 ':. ■ '* ; V^V* 

Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr . V .- v . 
385 390 -395 v^JV.. -^.H^^^V;;(:40q.=; .-;v^r;^ 

Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn He He Met Tyr . Yo/' 

405 . 410 "... .--.i' 

Ala Pro Thr Leu Trp Ala Arg Met He Leu Met Thr! His Phe^*. Phe Ser . .;Ay •/ 

420 425 430 ' 

He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys Glu He 

435 440 . 445 

Tyr Gly Ala His Tyr Ser He . Glu Pro 

.•;^450 . .■' • ■-■455 

Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser TyrVS^^^ 
'465-: ' ^470 ' ■ ' / 475 • -'^ ' : 480 

Gly Glu He Asn Arg Val Ala Ser Cys Leu Arg Lys; Leu Gly Val Pro 
485 490,; . 

Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala Lys Leu 

500 505 510 

Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu Phe Asn 

515 520 525 . 

Trp Ala Val Arg Thr Lys Leu Lys Leu Thr Pro He Pro Ala Ala Ser 

530 535 540 

Gin Leu Asp Leu Ser Gly Trp Phe Val Ala Gly Tyr Ser Gly Gly Asp 
545 550 555 560 

He Tyr His Ser Leu Ser Arg Ala Arg Pro Arg Trp Phe Met Trp Cys 

565 570 575 

Leu Leu Leu Leu Ser Val Gly Val Gly He Tyr Leu Leu Pro Asn Arg 
580 585 590 

<210> 12 
<211> 592 
<212> PRT 

<213> Hepatitis C Virus 
<400> 12 

Val Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu He Thr Pro Cys Ala 

1 5 10 V 15 

Ala Glu Glu Ser Lys ijeu Pro He Asn Ala Leu Ser Asn Ser Leu Leu 

20 25 30 

Arg His His Asn Met Val Tyr Ala Thr Thr Ser Arg Ser Ala Ser Gin 

35 40 ' ■- ' * • V 

Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Vai Leu Asp Asp His 

50 55 . 60 

Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys Val Ser Thr Val Lys 
65 70 . : 75 . \ • . .80 

Ala Lys Leu Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro Pro His 

85 90 . • 95 ' 

Ser Ala Arg Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn Leu 

100 105 ('■'"' 110 

Ser Ser Lys Ala Val Asn His He His Ser Val Trp Lys Asp Leu Leu 

115 120 . 125 . 

Glu Asp Thr Glu Thr Pro He Asp Thr Thr He Met Ala Lys Asn Glu 
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130 135 140 

Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu 
145 150 155 160 

He Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu 

165 170 175 

Tyr Asp Val Val Ser Thr Leu Pro Gin Ala Val Met Gly Ser Ser Tyr 

180 185 190 

Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val Asn Ala 

195 200 205 

Trp Lys Ser Lys Lys Cys Pro Met Gly Phe Ala Tyr Asp Thr Arg Cys 

210 215 220 

Phe Asp Ser Thr Val Thr Glu Asn Asp He Arg Val Glu Glu Ser He 
225 230 235 240 

Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He Arg Ser 

245 250 , 255 

Leu Thr Glu Arg Leu Tyr He Gly Gly Pro Leu Thr Asn Ser Lys Gly 

260 265 270 

Gin Ser Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr 

275 280 285 

Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Cys 

290 295 300 

Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp 
305 310 315 320 

Leu Val Val He Cys Glu Ser Ala Gly Thr Gin Glu Asp Ala Ala Ser 

325 330 335 

Leu Arg Val Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly 

340 345 350 

Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu Leu He Thr Ser Cys Ser 

355 360 365 

Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr Tyr 

370 375 380 

Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr 
385 390 395 400 

Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn He He Met Tyr 

405 410 415 

Ala Pro Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe Phe Ser 

420 425 430 

He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys Glu He 

435 440 445 

Tyr Gly Ala His Tyr Ser He Glu Pro Leu Asp Leu Pro Gin He He 

450 455 460 

Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro 
465 470 475 480 

Gly Glu He Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly Val Pro 

485 490 495 

Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala Lys Leu 

500 505 510 

Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu Phe Asn 

515 520 525 

Trp Ala Val Arg Thr Lys Leu Lys Leu Thr Pro He Pro Ala Ala Pro 

530 535 540 

Gin Leu Asp Leu Ser Gly Trp Phe Val Ala Gly Tyr Ser Gly Gly Asp 
545 550 555 560 

He Tyr His Ser Leu Ser Arg Ala Arg Pro Arg Trp Phe Met Trp Cys 

565 570 575 

Leu Leu Leu Leu Ser Val Gly Val Gly He Tyr Leu Leu Pro Asn Arg 
580 585 590 



<210> 13 
<211> 592 
<212> PRT ♦ 
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<213> Hepatitis C Virus 
<400> 13 

Met Ser Met Ser Tyr Thr Trp Thr Gly Ala Leu He Thr Pro Cys Ala 

15 10 15 

Ala Glu Glu Ser Lys Leu Pro He Asn Ala Leu Ser Asn Ser Leu Leu 

20 25 30 

Arg His His Asn Met Val Tyr Ala Thr Thr Ser Arg Ser Ala Ser Gin 

35 40 45 

Arg Gin Lys Lys Val Thr Phe Asp Arg Leu Gin Val Leu Asp Asp His 

50 55 60 

Tyr Arg Asp Val Leu Lys Glu Met Lys Ala Lys Val Ser Thr Val Lys 
65 70 75 80 

Ala Lys Leu Leu Ser Val Glu Glu Ala Cys Lys Leu Thr Pro Pro His 

85 90 95 

Ser Ala Arg Ser Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn lUeu 

100 105 110 

Ser Ser Lys Ala Val Asn His He His Ser Val Trp Lys Asp Leu Leu 

115 120 125 

Glu Asp Thr Glu Thr Pro He Asp Thr Thr He Met Ala Lys Asn Glu 

130 135 140 

Val Phe Cys Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu 
145 150 155 160 

He Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu 

165 170 175 

Tyr Asp Val Val Ser Thr Leu Pro Gin Ala Val Met Gly Ser Ser Tyr 

180 185 190 

Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu Val Asn Ala 

195 200 205 

Trp Lys Ser Lys Lys Cys Pro Met Gly Phe Ala Tyr Asp Thr Arg Cys 

210 215 220 

Phe Asp Ser Thr Val Thr Glu Asn Asp He Arg Val Glu Glu Ser He 
225 230 235 240 

Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala He Arg Ser 

245 250 255 

Leu Thr Glu Arg Leu Tyr He Gly Gly Pro Leu Thr Asn Ser Lys Gly 

260 265 270 

Gin Ser . Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr 

275 280 285 

Ser Cys Gly Asn Thr Leu Thr Cys Tyr Leu Lys Ala Ser Ala Ala Cys 

290 295 300 

Arg Ala Ala Lys Leu Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp 
305 310 315 320 

Leu Val Val He Cys Glu Ser Ala Gly Thr Gin Glu Asp Ala Ala Ser 

325 330 335 

Leu Arg Val Phe Thr Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly 

340 345 350 

Asp Pro Pro Gin Pro Glu Tyr Asp Leu Glu Leu He Thr Ser Cys Ser 

355 360 365 

Ser Asn Val Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr Tyr 

370 375 380 

Leu Thr Arg Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr 
385 390 395 400 

Ala Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn He He Met Tyr 

405 410 415 

Ala Pro Thr Leu Trp Ala Arg Met He Leu Met Thr His Phe Phe Ser 

420 425 430 

He Leu Leu Ala Gin Glu Gin Leu Glu Lys Ala Leu Asp Cys Glu He 

435 440 445 

Tyr Gly Ala His Tyr Ser He Glu Pro Leu Asp Leu Pro Gin He He 
450 455 460 
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Gin Arg Leu His Gly Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro 
465 470 475 480 

Gly Glu He Asn Arg Val Ala Ser Cys Leu Arg Lys Leu Gly Val Pro 

485 490 495 

Pro Leu Arg Val Trp Arg His Arg Ala Arg Ser Val Arg Ala Lys Leu 

500 505 510 

Leu Ser Gin Gly Gly Arg Ala Ala Thr Cys Gly Lys Tyr Leu Phe Asn 

515 520 525 

Trp Ala Val Arg Thr Lys Leu Lys Leu Thr Pro He Pro Ala Ala Pro 

530 535 540 

Gin Leu Asp Leu Ser Gly Trp Phe Val Ala Gly Tyr Ser Gly Gly Asp 
545 550 555 560 

He Tyr His Ser Leu Ser Arg Ala Arg Pro Arg Trp Phe Met Trp Cys 

565 570 575 

Leu Leu Leu Leu Ser Val Gly Val Gly He Tyr Leu Leu Pro Asn Arg 
580 585 590 

<210> 14 
<211> 20 
<212> DNA 

<213> Hepatitis C Virus 
<400> 14 

tcaatgtcct acacatggac 20 

<210> 15 
<211> 10 
<212> DNA 

<213> Hepatitis C Virus 

<400> 15 
ctacacatgg 

<210> 16 
<211> 41 
<212> DNA 

<213> Hepatitis C Virus 
<400> 16 

ctctgattac accatgcgct gcggaggaga gcaagctgcc c 41 

<210> 17 
<211> 28 
<212> DHA 

<213> Hepatitis C Virus 
<400> 17 

aatgcgctga gcaactcttt gctgcgcc 28 

<210> 18 
<211> 47 
<212> DNA 

<213> Hepatitis C Virus 
<400> 18 

ccataacatg gtctatgcca caacatcccg cagcgcaagc cagcggc 47 

<210> 19 
<211> 22 
<212> DNA 
' <213> Hepatitis C Virus 



. . W099/S1781 : - ;■- - PCI7US99rt)7404 

•/ <4oo> 19 !'[:'■: ' / .- ^ '^V 

gaagaaggta acttttgaca gg . !• 

^ * * <2io> 20 ; ' ' -y- . ' / ■ • ■ " 

<2ii> 23 ' . - :\ ... • " ■ * ' ' ' • 

<212> DNA 

<213> Hepatitis C Virus . ■/; /• 

. . caagtcctgg atgaccacta'ccg .\ " ■[ V • * ^//V^ 

<2io> 21 ^;;^v^.= ' ^ " Y / ' V/ " 

<2i2> mui " '^V ' -'i \ ■ • ' 

<213> Hepatitis C Virus . . 

<4oo> 21 • ■ • ■/ ■ ' 

gacgtgctca agga ;/-.. . ' . ^ ' - • * '^r' 

^' ;^^'V::y •<2ii> la ^: . /:^M^/^^^^/;V.;^C2^^ . i^':'^.^ ^ v^^■•r^i#^-;^:- 

:•^yv:;■•';;^v;;^^:■:v': <2i2> DNA ; ' ^:v■•^n*•■^^;;*. "V^'V c ;v}*v:^' ^ " j : ' ;^s'^'-vVv;/;-: • :^;.^;r^H^; -1^^^^^^ 

<213> Hepatitis C Virus - • ■ ' • ' ' . ' ' ' • ^?.'/'-<.,- ' v'//;'-/' 

• ;)rv;;.;/*; ' • * ^ . . .. * "Miv;-:/'* . ■; V 

• ^kV:, *v-:^ <400> 22 v,. - • . . :•* . ;:;":*.. . .*.%'•: 
V " ';.- V, atgaaggcga aggcgtcc ^® ' ■ ' 

<210> 23 

' ' <211> 16 . : 

<212> DNA 

<213> Hepatitis C Virus 

<400> 23 
ggaagaagcc tgtaag 



<210> 24 
<211> 24 
<212> DNA 

<213> Hepatitis C Virus 

.<400> 24 
gaacctatcc agcaaggccg ttaa 

<210> 25 
<211> 19 
<212> DUA 

<213> Hepatitis C Virus 

<400> 25 
accagagaaa ggaggccgc . 

<210> 26 
<211> 14 
<212> DNA 

<213> Hepatitis C Virus 

<400> 26 
acccagactt gggg 



<210> 27 
<211> 20 
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•;i6 / 20.: : ./ i T .* • . w.' 



<212> DNA 

<213> Hepatitis C Virus 

<400> 27 
tctccaccct tcctcaggct 

<210> 28 . 



<213> Hepatitis C Virus 



I 



cgagttcctg gtgaatgcc V/^yi 



<210> 29 



<211> 24 

<212> DNA ' : ;V . ' 

<213> Hepatitis C Virus 



<400> 29 
tgccctatgg gcttcgcata. tgac 



scat, tjic'v ::, V , i'fi'iV : i-' ■ ISSlK ''©f 



<210> 30 
<211> 25 
<212> DNA 
<213> Hepatitis C Virus 

<400> 30 

tttcgactca acggtcaccg agaat . 25 

<210> 31 
<211> 15 
<212> DNA 

<213> Hepatitis C Virus 



<400> 31 
gttgaggagt caatt 



15 



<210> 32 

<211> 20 ^ ;•. ' • 

<212> DNA " ■ -V .'iy' . '* . v). / : 

<213> Hepatitis C' Virus V 

<400> 32 

ttggcccccg aagccagaca i . .^^ 

<210> 33 • 
<211> 17 

<2i2> DNA . .:r ^.s. v*--\:.-. 

<213> Hepatitis C Virus ' ; '! U^l' . 



; 17 



<400> 33 • \ 

aaggtcgctt acagagc V-,*;- ' 

<210> 34 . ■ V ;■ : ' ; " V;.V:''. = ^ • ''.v' .'• ' \ 

• <2ii> 17 • ' ■ • .■ '•• ••';V- ■ 

<2i2> DNA : • / ' ^ • ^- • i ^^.^ 

<213> Hepatitis C virus ' V'--'''\.' '^^^V''' \H ^ ' V' 

<400> 34 . . . 
atcgggggtc ccctgac ,17. 



W099/51781 ^/V- . '-V/ on "^V ' ' ^i^K-.'-'^^^.- ^ ^^^P<F^S99/^ 



<211> 18 . ■ ' ' 

<212> DNA ■] . ...... j-:^ . -. -.v.., 

" <213> Hepatitis C Virus - V" * , * . • ^- ' VV/*:' '«* ^-^ ' V^/^^^'- ^•^^'^^V*^ 



- . / - " • * • 



<400> 35 .'/.'./. , ■" , 

taactcaaaa gggcagag -V'-' V^.; --V-r .-' ' V ' * r.'-^^ -.Vv!'' : V-'^V]^i.*'!\* ^'V ^ 



<213> Hepatitis C>irus / '^^ ^-y-!^^'' ;V/':>i^^ : ■ -^/^^ ^.M:j>'/^^i^i^^ 

<400> 36 '-'l: /• ' V '/'^y'^-^ Z^""^-- -'v'' * '.V:-: • 

, atgttacttg aaggcctct y v • oVv- ■^^X^^'^'-" .-■'^ ^ --v U : i^r -^^ /^W.-"- 



<2X0> 37 
<211> 25 
<212> DNA 

<213> Hepatitis C Virus ; '--V \;;-m.! 



>V>- gatgcttgtg tgcggagacg acctc ^/ ^ ; 5;? ^ ^ -/^^U^4v^^)•:j■^^ 

; ^ . . v='v- 



gatgcttgtg tgcggagacg acctc 
<210> 38 

<211> 31 

■; ' <212> DNA 

. ^ <213> Hepatitis G Virus 



<400> 38 

ggtcgcgcac gatgcatctg gcaaaagggt a 31 

<210> 39 
<211> 19 
<212> DNA 

<213> Hepatitis C Virus 



19 



<400> 39 

caccacccct cttgcgcgg .\. 

<210> 40 ■• './\: • ;V:\ V^. 

. • ■ <2ii> 25 ■ ;■ ' .:'Vvv.". -S-.-'. ■ 

<212> DNA ■■■ . \\ ' v • ' . Z:^-.: ' l- 

<213> Hepatitis C Virus ': 

■ ' /• ' ' ■ ' ' h. - v.^'^V ' • " ' '\ ' 

<400> 40 . ; V rr-:^ . ;Vi' ■ * ^^ 

ctccatcctt ctagctcagg agcaa . . 25 

<2io> 41 ^ ' ■ . ' "^vV/v/^-" -'f-v'' ' ' ^-y-'^':?. ' '*.:•:* 

■ ' <2ii> 20 - : ' V. ' ■ 'V^ ' ■ V^''} ■:• 

<212> DNA. . .'"'V ^ ..' \. ■ : 

<213> Hepatitis C Virus .. '•'^'.^^ 



<4b0> 41 ■ 

agttactgtc ccaggggggg 

<210> 42 . . 
<2ll> 17 . , 

<212> DNA 

<213> Hepatitis C Virus 



1 
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<400> 42 
tccggctgcg tcccagt 

<210> 43 
<211> 26 
<212> DNA 

<213> Hepatitis C Virus 



PCTAJS99/07404 
17 



<400> 43 
ccgaccccgc tggttcatgt ^gtgcc 

<210> 44 
<211> 19 
<212> DNA 

<213> Hepatitis C Virus /' 

<400> 44 
ctacctgctc ccgaaccga 

<210> 45 . 
<211> 19 
<212> DNA 

<213> Hepatitis C Virus 

<400> 45 
ctacctgctc cccaaccga 

<210> 46 
<211> 80 
<212> PRT 

<213> cHepatltis C Virus 



<400> 46 
Ser Met Ser Tyr 'Chr Trp Thr Gly 

1 5 
Glu Glu Ser Lys Leu Pro lie Asn 
20 

His His Asn Met Val Tyr Ala Thr 

35 40 
Gin Lys Lys Val Thr Phe Asp Arg 

50 55 
Arg Asp Val Leu Lys Glu Met Lys 
65 70 



Ala Leu lie Thr Pro Cys Ala Ala 

10 15 
Ala Leu Ser Asn Ser Leu Leu Arg 
25 30 
Thr Ser Ttrg Ser Ala Ser Gin Arg 
45 

Leu Gin Val Leu Asp Asp His Tyr 
60 

Ala Lys Ala Ser Thr Val Lys Ala 
75 80 



<210> 47 
<211> 47 
<212> PRT 

<213> cHepatitis C Virus 
<400> 47 

Ser Val Glu Glu Ala Cys Lys Leu Thr Pro Pro His Ser Ala Arg Ser 

15 10 15 

Lys Phe Gly Tyr Gly Ala Lys Asp Val Arg Asn Leu Ser Ser Lys Ala 

20 25 30 

Val Asn His lie His Ser Val Trp Lys Asp Leu Leu Glu Asp Thr 
35 40 45 

<210> 48 
<211> 40 
<212> PRT 

<213> CHepatitis C Virus 



W0 99/5178t 



19 / 20 
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<400> 48 

Glu Thr Pro lie Asp Thr Thr He Met Ala Lys Asn Glu Val Phe Cys 

1 5 10 15 

Val Gin Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu He Val Tyr 

20 25 30 

Pro Asp Leu Gly Val Arg Val Cys 
35 40 

<210> 49 
<211> 102 
<212> PRT 

<213> cHepatitis C Virus 
<400> 49 

Ser Ser Tyr Gly Phe Gin Tyr Ser Pro Gly Gin Arg Val Glu Phe Leu 

15 10 15 

Val Asn Ala Trp Lys Ser Lys Lys Cys Pro Met Gly Phe Ala Tyr Asp 

20 25 30 

Thr Arg Cys Phe Asp Ser Thr Val Thr Glu Asn Asp He Arg Val Glu 

35 40 45 

Glu Ser He Tyr Gin Cys Cys Asp Leu Ala Pro Glu Ala Arg Gin Ala 

50 55 60 

He Arg Ser Leu Thr Glu Arg Leu Tyr He Gly Gly Pro Leu Thr Asn 
65 70 75 80 

Ser Lys Gly Gin Ser Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val 

85 90 95 

Leu Thr Thr Ser Cys Gly 
100 

<210> 50 
<211> 56 
<212> PRT 

<213> cHepatitis C Virus 
<400> 50 

Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val Val He Cys Glu Ser 

15 10 15 

Ala Gly Thr Gin Glu Asp Ala Ala Ser Leu Arg Val Phe Thr Glu Ala 

20 25 30 

Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro Glu Tyr 

35 40 45 

Asp Leu Glu Leu He Thr Ser Cys 
50 55 

<210> 51 
<211> 37 
<212> PRT 

<213> CHepatitis C Virus 
<400> 51 

Ser Val Ala His Asp Ala Ser Gly Lys Arg Val Tyr Tyr Leu Thr Arg 

15 10 15 

Asp Pro Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr Ala Arg His 

20 25 30 

Thr Pro Val Asn Ser " 
35 

<210> 52 
<211> 29 
<212> PRT 

<213> CHepatitis C Virus 
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<400> 52 

Cys Le^ Arg Lys Leu Gly Val Pro Pro Leu Arg Val Trp Arg His Arg 

15 10 15 

Ala Arg Ser Val Arg Ala Lys Leu Leu Ser Gin Gly Gly 
20 25 

<210> 53 
<211> 25 
<212> PRT 

<213> Hepatitis G Virus 
<400> 53 

Phe Val Ala Gly Tyr Ser Gly Gly Asp lie Tyr His Ser Leu Ser Arg 

1 5 10 15 

Ala Arg Pro Arg Trp Phe Met Trp Cys 
20 25 
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BOX XL OBSERVATIONS WHERE UNITY OF INVENTION WAS LACKING 
This ISA found multiple mveotioas as foDows: 
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inventive oonixi* under PCT Rule 13.1. In ofder for afl inventions to be ae«n*ed. the appn>priatc additional sear^ 
. fees must be piid. 

Gnnq) I. claim(s)l-15. 18. 19, 29, 31, 33, 35, 3W1, 90. 100. dimwn to nueleie aeid encoding a hepatitis C vims 

(HCV) NSSB piotBin, variants , mutants and derivatives, and host ceUs. 

Group n. claim(8) 16, 17. 20-26, 30, 32, 34, 36. 42, drawn to hepstitis C vin» proteins. 

Groiq> ni(aHn), claim(8) 27, 28, 44, drawn to hepatitis V virus proteins, altefBtms of the proteins and various 

mutants. / 

Group IV(aHk), claim(s)37, 43, 81, 91, 92, 93, 94^, drawn to mutants of various nucleic acids of hepatitis C virus 
plus variants and multiple altered derivatives. 

Group V, claim(s) 45-50, drawn to mcdtod of assaying a test compound for modulating activity against HCV. 

Group VI, daim(s) 51-55, drawn to method of assaying a compound for antagonistic ai^ivity. 

Groiqi Vn, claim{8)56-60, drawn to method of assaying a compouml for agonistic art 
Group Vm, Glaim(8) 61-65, drtwo to method of assaying a test con^wund for inteiBction wfth HCV protein. 
Group IX, dBtni(8) 66-67, drawn to method of tnteiaction with HCV nudete acid. 

Group X, cfaum(8)6g-71, diawn to method of detecting HCV in a biological sample comprising nudric add 
amplilication. 

Gmsp XI , daim(8) 72-75, diawn to method for detecting interactions between viral polypeptide and aniibocfos. 
Group XII(aHh), claim(s) 76-79, drawn to antibodies. 

Group Xin. cUim<8)80, drawn to antibody with specific aflinity to polypeptide oonqirising in Table 2. 
Group XIV, daim(8) 82, 87, drawn to method of introdudng a nucleic acid uito a hosL 
Group XV, daim(B) 83-86, drawn to viral antigens. 
Group XVI, chum(8)88, dxawn to method of inducing an immune response. 
Group XVII, daimCs) 89, drawn to a method of amidifymg HCV nucldc acids. 
Group XVni, claim(8) 98, 99, drnwn to method of assessing the functionality of HCV NS5B protein. 
Group XDC, ciaim<8)101, drawn to a method of assaymg a test compound for anUgonist against HCV in a ceU line. 
Group XX, claim(s) 102, drawn to a method of assaying a test compound for agonist activity HCV in a cell line. 
Group XXI, cbim(8) 103-110. drawn to a method of preparing a HCV NS5B protein. 
Group XXn, elaim(8)lll, dmwn to a method of isolating an NS5B protein. 

Qtoap XXm, dum(8) 112, 113, 119, 120, drawn to a method of generating an infectious viral vector. 
Group XXIV, cfaum(8) 114, 115. dmwn to & method of generating an infectious viial vector comprising substituting 
HCV NSSB bomologous gene, aad modified gpnes as set forth in Table 6. 

Group XXV, c]aim(s)116, dnwn to a host animal. 

Group XXVI, daim(s) 117, drawn to a method of isolating antibodies by screening antibody library. 
Group XXVn, claim(s) 118, drawn to a method of propagating HCV in a in vitro system utilizing SEQ ID NO: 1-7. 
Group XXVIU, claim(8)121-124, drawn to method of propagating HCV in an in vitro system idilizing SEQ ID NO: 2, 
8-13. 

Group XXIX, clum(s) 125-128, drawn to a method of propagating HCV in a living host 
Group XXX, claim(s) 129, 130, drawn to a kk for detecting the HCV nucleic acids. 
Group XXXI, claim(8)13M32, drawn to a tit for detecting HCV utiiizmg aniibodiea. 



This applicatkm^oodtains datms directed to more than one species of the generic invention. These species are deemed 
to lack of Inveirtion because th^ are not so linked as to form a single mvcotive concept under PCT Rule 13.1. 
In order for more than one ^ies to be searched, the appropriate additional seareh fees must be pud. The species are 
as follows: Nucleic adds moleeule as act forth m Tabic 6 (species l-ll)as it applies to daims of Group IV. 
Polypeptide of Table 4, 5 (total species 1-I4)as they apply to claims of Group II!. Antibodies for polypeptide of Table 2 
(1-8). For explanation see bdow. 
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The ioventioiis listed as Groups I-XXXl do not idate to a smgfe inventive concept under PCX Rule 13.1 because, under 
PCT Rule 13.2, Ih^ lack the sanie or c or rtap on d ing specia] leehnical features for the following reasons: The uiveotioti 
of Group I does not make a contffliution over the prior sit as evidence by Lohnuuu) et al (Journal of Virology^ Nov. 
1997, VoL 71. No. 11, p. 8416-8428), also as evidence by Miyamura et al (US Patent No. 5,372,928. Dee. 1994), and 
Bdueos et al (The EMBO louraal, 1996. VoL 15. No. 1. p. 12-22). Since the nuehae actd molecule of hepatitis C 
vims NS5B is already known in the ait thiis the invention lacks unity of invention as defined by PCT Rule 13.2. The . 
cited refenaioes prove that the technical feature of Group I does not make a contribution over the prior ait. The claims 
are not so Vtn^f^ by a spedal »«*^Km<>«i feature within the meaning of PCT Rule 13.2 so as to form a single inventive 
concept, accordin^y, the unity of invention is laddng among all groups. 

The tpeaet listed above do not retate to a single mvcntive concept under PCT Rule 13.1 because, under PCT Rule 
13.2, the ip mfT lack the ssme or concipun ding special tr^ni^' feitures for the following reasons: Each mutations 
Issted oonfeiB WaaA structure and prenmiably differeoi effect on antigenicity and virulence of the virus. Since die 
effects of m"*»«^««M are unpretfietaUe, a mutatkm of one facation does not leach or suggests mutation at a different 
loeation. Therrfore, each location is dUict species. 
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